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A record made to be broken 


Japan’s major science-funding agency has a clean record when it comes to research fraud. Now is 
the time for it to step up and resolve a long-running case of alleged scientific misconduct. 


Technology Agency (JST), is either very good at picking honest 

scientists, or does not look very hard for evidence to the con- 
trary. The total number of fraud cases confirmed by the JST since 
its beginnings in 1957 — an organization that delivered ¥87.8 billion 
(US$942 million) in research funding last year — is zero. 

The US National Institutes of Health reports a dozen or so such 
cases every year. Elsewhere in Japan, there has been a spate of high- 
profile cases of fraud in recent years — including that of Yoshitaka 
Fujii, an anaesthesiologist who currently has the highest number of 
retractions by a single author. The JST’s perfect record would seem 
to be against the odds. 

A long-running case typifies the problems. Akihisa Inoue, retired 
president of Tohoku University in Sendai, has been hounded by accu- 
sations of scientific misconduct for years. 

There is no evidence that Inoue has committed any misconduct, 
and he denies manipulating any data. He is one of the world’s leading 
experts in metallic glasses — materials that are more elastic and more 
resistant to corrosion than metals — and has published more than 
2,500 papers on the subject. He has previously told Nature that other 
researchers may simply lack the skills and experience to reproduce 
some of his lab’s results (see Nature 470, 446-447; 2011). 

There is a clear need for the JST to resolve the situation. Questions 
over Inoue’s work were first raised by an anonymous whistle-blower 
nearly six years ago, and the case touches on research going back two 
decades (see Nature 483, 259; 2012). There have been many compli- 
cated twists. Tao Zhang, a co-author with Inoue on several papers, 


Jism major funder of big science, the Japan Science and 


says that the documents and metallic glass samples that could have 
settled the matter were lost when a container carrying his belongings 
back to his home in China fell into the sea. A committee set up by 
Tohoku University in 2007 to determine whether to investigate the 
misconduct allegations was dominated by university administrators 
who had previously been promoted to their posts by Inoue. The com- 
mittee’s conclusion — that there was no need to investigate — led to 
charges of conflict of interest by an economist and other colleagues 
at the university, who raised questions with journal editors. A series 
of corrections have been issued about the relevant papers. 

Last year, a separate committee convened by the JST — which gave 
Inoue ¥1.8 billion for a five-year project starting in 1997, during 
which several of the problematic papers were produced — reported 
that although they couldn't find evidence of misconduct, a closer 
look was justified. So last spring, the JST formally requested that 
the university investigate. Twelve months on, neither the university 
nor the JST would confirm whether an investigation was under way, 
who would be on the investigation committee or when it would start 
and end. 

There is no deadline by which any investigation must report back, 
and if Tohoku University fails to respond, the JST can take no action 
other than to ask again. The university, which might have to return 
some or all of the money if it discovers misconduct, again faces a 
conflict of interest in addition to the embarrassment of an inves- 
tigation. Doing nothing apparently has no consequences. And it is 
one way, now and in the future, to ensure that the JST will keep its 
perfect record. = 


Against the law 


Behaviours proposed for black holes conflict 
with the laws of physics. 


questions that are on the lips of researchers. On page 20,a News 

Feature poses the conundrum: will an astronaut who falls into 

a black hole be crushed or burned to a crisp? The answer, according to 

the disappointing end to the Disney film The Black Hole (Gary Nelson; 

1979) is neither: the astronaut will go on a psychedelic trip through 

a (possibly metaphorical) hell and heaven. The answer according to 
physicists in 2013 is even more ofa let-down: they don’t know. 

The European particle-physics laboratory CERN, located near 

Geneva in Switzerland, has had a mixed experience with black holes: 

various challenges to its Large Hadron Collider over the years have 


Ne let it be said that Nature does not address the science 


focused on the (very small) chance that it could create (very small) 
black holes that would destroy the world. So the CERN press office 
must have looked on nervously as physicists gathered there last month 
to address the mysterious fate of our unfortunate astronaut. There is 
more at stake than the grisly demise of a single free-floating space trav- 
eller. The physics is complicated but the take-home message is this: if 
the astronaut fries, then Einstein’s framework of general relativity goes 
up in smoke with it; if the astronaut is crushed and torn by the black 
hole’s internal variation in gravity, then quantum mechanics is wrong. 

So which is it, relativity or quantum theory; heaven or hell? Debate 
continues. One compromise has the astronaut hang around outside 
the black hole to hoover up some quantum information as it leaks, 
use it to do some maths and then jump in to see if either theory is 
right. So far so good (for the laws of physics if not the astronaut, who 
perishes either way) except that the maths is so difficult that, by the 
time scientists have an answer, the black hole will have evaporated 
beneath them. Now that would make for a better ending, at least for 
the astronaut. Disney, which is said to be remaking its film, should 
take note. m 
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university to announce the Breakthrough Prize in Life Sciences. As 

someone who hopes for a career in biomedical research, I applaud 
these awards. Biomedical science, when practised correctly, is a well- 
spring of knowledge, innovation and human health. But is it practised 
correctly? And if not, how does it need to change? 

In recent years, it has become clear that biomedical science is 
plagued by findings that cannot be reproduced. This wastes grant 
money and hinders development of new treatments and cures. The 
irreproducibility epidemic is exacerbated by a funding structure that 
rewards publications above all else, whether they are reproducible or 
not. Science as a system should place more importance on reproduc- 
ibility. Not every paper needs to be medically relevant, but at the very 
least they should all be reproducible. Reproduc- 
ibility separates science from mere anecdote. 

Some journals already offer to publish rep- 
lication studies, and there are nascent projects 
aimed at reproducing work in individual fields 
from disease to psychology. But these are stop- 
gap solutions. A more comprehensive answer 
is required. 

As a start, funding agencies should tie grant 
funding to replication. A portion of their 
budget could be set aside to pay for independ- 
ent replication studies. If a paper cannot be 
replicated, the authors should be required to 
amend or retract it. Funding agencies would 
then consider a principal investigator’s history 
of reproducibility in grant reviews. This key 
reform would ensure that career success rests 
on the validity of findings rather than whether they are published in 
trendy journals. 

Many scientists will object to this proposal, and for many reasons. 

It sounds complicated. Scientists would apply for grants, conduct 
research and publish their findings much as they do now. The only 
difference is that replication studies would be more plentiful and more 
would be published. Each replication would be electronically linked 
to the original paper and boost its validity. Studies that could not be 
reproduced would be marked as such. Not all non-reproducible work 
is flawed, of course, but a red flag against a paper, accompanied by an 
amendment from the original authors noting discrepancies in meth- 
odology and analysis that might explain why it was not reproduced, 
would become a normal part of scientific discourse. Science should 
work through open dialogue, accessible to everyone. 

It would slow the dissemination of novel 
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If a job 1s worth doing, 
it is worth doing twice 


Researchers and funding agencies need to put a premium on ensuring that 
results are reproducible, argues Jonathan F. Russell. 


are already doing them. Under my proposal, more of these studies 
would be published. More people will be needed, and if granting agen- 
cies provided funds, more scientists would do replication studies. 

Replication studies are hard. Sharing reagents and methods is not that 
difficult, and it is already mandated by funding agencies. Unfortunately, 
many do not comply. My proposal would encourage original authors to 
work with replicating groups and make the studies easier, because each 
replication would bolster the original authors’ reputation. 

Publishing amendments would take a lot of time. It would certainly 
take more time than is currently spent on it, but scientists already 
have similar conversations by e-mail, at conferences and in lab meet- 
ings. If publishing amendments were explicitly rewarded, people 
would take the time to do it. 

Science already self-corrects. True, but the 
timescale is long and the associated waste is 
vast. Even outright fraud can take decades to 
come to light, and negative results rarely come 
to light at all. Under my proposal, negative 
results would be valued and valuable. 

But there is no money available. This reform 
should save money by redirecting spending 
towards science that is reproducible. Society will 
receive a higher return on its investment in the 
form of treatments and cures. 

This proposal is too radical. The same was 
once said about open-access publication. Yet 
that is quickly becoming the norm thanks to 
policy changes by funding agencies. 

The US National Institutes of Health and other 
granting agencies will never adopt this policy. 
National governments, which fund granting agencies, could impose 
the reform and ensure that agencies adopt it. 

Acknowledging irreproducibility in science undermines public trust. 
Trust is already being undermined. It is best that we reverse the tide 
and restore that trust by practising transparency. 

It would discourage high-risk, high-reward science, which is less likely 
to be reproducible. There would be no requirement that a study be rep- 
licated before publication. Ifa risky paper ends up being reproducible, 
everybody wins. But if repeated attempts to reproduce it fail, itis nota 
valid finding — there is no reward to society. Why pretend otherwise? 

In 1953, then US president Dwight Eisenhower said that every dol- 
lar spent on war “signifies, in the final sense, a theft from those who 
hunger and are not fed, those who are cold and not clothed”. The same 
is true for every dollar spent on science that cannot be replicated. We 
can do better. = 


Jonathan FE. Russell recently defended his PhD and is completing 
medical school at the University of California, San Francisco. 
e-mail: jonathan.russell@ucsf.edu 
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| NEUROSCIENCE 
Faulty link in 
schizophrenia 


The impaired learning and 
reasoning skills seen in 
people with schizophrenia 
may be caused by defective 
connectivity between two 
brain regions. 

Joshua Gordon and 
Christoph Kellendonk 
at Columbia University 
in New York and their 
colleagues used mice to 
mimic the abnormal brain 
activation seen in people with 
schizophrenia when they 
perform cognitive tasks. 

The researchers genetically 
engineered mice so that 
a synthetic compound 
would reversibly inhibit 
neurons in the mediodorsal 
thalamus. They found that 
even a slight reduction in 
activity of this brain region 
altered connectivity with the 
prefrontal cortex and caused 
poor cognition. 

This implicates the altered 
brain patterns in humans with 
the disease as a contributor to 
rather than a consequence of 
cognitive deficits, the authors 
say. 

Neuron 77, 1151-1162 
(2013) 


Russian-doll-style 
droplets 


Complex emulsions, such as 
spheres of oil and water nested 
inside each other, can be 
assembled in one step using a 
simple device, report Chang- 
Soo Lee at Chungnam National 


Selections from the 
scientific literature 


ANIMAL COMMUNICATION 


Bees of bigger hives forage better 


Larger honeybee colonies benefit from their 
greater capacity to gather information. 

Matina Donaldson-Matasci at the University 
of Arizona in Tucson and her team prevented 
honeybees (Apis mellifera) from communicating 
through their waggle dance (pictured), and 
monitored what happened in different-sized 
colonies. Bees in large colonies in which normal 
communication occurred were the most efficient 


University in Daejeon, South 
Korea, and his colleagues. 
The authors dispersed a 
water-based solution into a 
tiny chamber that opened into 
an oil-filled microchannel. 
This microfludic device 
caused the droplets to 
form double, triple and 
even quadruple (pictured) 
emulsions, depending on the 
concentration of the solvent. 
By adding detergent, the 
researchers could create even 
more conformations. 
These emulsions 
could be used to make 
functional materials, such 
as multichambered drug 
capsules, the authors say. 
Adv. Mater. http://dx.doi. 
org/10.1002/adma.201204657 
(2013) 
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Chemically 
camouflaged fish 


In the dark and quiet of 
freshwater ponds, a fish may 
use a cloak of chemicals to 
prowl for prey. 

William Resetarits at Texas 
Tech University in Lubbock 
and Christopher Binckley 
at Arcadia University in 
Glenside, Pennsylvania, set 
up mock ponds and recorded 
how several species of 
predatory fish affected whether 
aquatic beetles moved in, or 
frogs laid eggs. The authors 
placed the fish in screened 
chambers that concealed them 
visually, but not chemically, 
and measured colonization 
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food-finders, sending new foragers to known 
resources up to four hours earlier than smaller 
colonies or large colonies in which waggle dances 
did not convey information. 

This work provides some of the first 
experimental evidence that communication is 
particularly beneficial to large groups of social 
insects, the authors say. 

Anim. Behav. 85, 585-592 (2013) 


of the ponds by more than 

a dozen prey species. The 
prey avoided ponds that 
contained fish such as bluegill 
(Lepomis macrochirus) and 
the bluespotted sunfish 
(Enneacanthus gloriosus), but 
ponds containing pirate perch 
(Aphredoderus sayanus) — a 
particularly rapacious beetle 
muncher — were almost as 
popular as ponds without fish, 
suggesting that this fish masks 
its scent. 

Although chemical 
deception has been reported 
for specific predator-prey 
pairs, this perch may be the 
first example of generalized 
chemical camouflage, the 
authors say. 

Am. Nat. http://dx.doi. 
org/10.1086/670016 (2013) 
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QUANTUM INFORMATION 


Quality photons 
from nanocrystals 


Photons that are suitable for 
quantum communication can 
be emitted from solid materials. 

A roadblock to building 
quantum information 
networks is the need for 
streams of identical photons 
that oscillate in sync with 
each other. Such streams 
are produced by lasers, 
but integrating these with 
electronics is difficult. 

Now Mete Atatiire at the 
University of Cambridge, UK, 
and his colleagues report that 
they have generated such a 
stream from semiconducting 
nanocrystals, or quantum 
dots. These dots exhibit a 
phenomenon called resonance 
fluorescence, in which emitted 
photons resonate with one 
another, and so oscillate in 
syne. 

Nature Commun. 4, 1600 (2013) 


MICROBIOLOGY 


Whole sequence, 
no culture 


The development of a 
culture-free method for 
sequencing the genomes of 
chlamydia-causing bacteria 
means that researchers can 
obtain genomes directly from 
clinical samples and skip 

the tricky, time-consuming 
process required to grow these 
bacteria. 

Helena Seth-Smith and 
Nicholas Thomson at the 
Wellcome Trust Sanger 
Institute in Hinxton, UK, 
and their colleagues purified 
Chlamydia trachomatis 
(pictured) from genital swabs 
using antibody-encrusted 
magnetic beads that latch onto 
C. trachomatis cells — leaving 


behind contaminants such 

as human cells and other 
microbes. Bacterial DNA 

was then copied, sequenced 
and assembled using 
bioinformatic software; this 
yielded five complete genome 
sequences from 18 samples 
tested. 

The technique could also 
be used to sequence other 
difficult-to-culture pathogens, 
the authors say. 

Genome Res. http://dx.doi. 
org/10.1101/gr.150037.112 
(2013) 


| ___NEUROSCIENCE 
Fickle wiring in 
ageing brains 


The decline of cognitive 
ability seen in some elderly 
people may not result 
from fewer brain cells, but 
from a rise in capricious 
connections. 

Vincenzo De Paola at 
Imperial College London, Sen 
Song of Tsinghua University 
in Beijing and their colleagues 
observed the cell connections 
that shoot signals through 
the brain by fitting glass 
windows into the craniums 
of old and young mice. 
Although the researchers 
expected older mice to have 
fewer firing points, they found 
more. But these connections 
were erratic. In older mice, 
the connections had high 
turnover rates, and wavering 
firing strength. The older 
mice also performed less well 
ona memory test. 

The researchers suggest that 
mental decline seen in ageing 
mice is due to disorderly 
wiring. 

Proc. Natl Acad. Sci. USA http:// 
dx.doi.org/10.1073/pnas. 
1218731110 (2013) 


Symmetry 
affirmed 


An unprecedented 
measurement of an 
antiproton suggests that 
matter and antimatter truly 
mirror each other. 

Gerald Gabrielse at Harvard 
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The most viewed 
papers in science 


CHOICE 


Making films on the edge 


A change in the orientation of molecular 
sheets in layered films may open up new 
uses for these materials. 

Thin films are usually grown by stacking 
atoms horizontally, like sheets in a pad of paper. But Yi Cui 
and his colleagues at Stanford University in California have 
grown layers vertically, like dominoes balanced on their edges. 
The researchers started with strips of molybdenum and added 
sulphur or selenium atoms, which diffuse down through the 
molybdenum, carving nanometre-sized stripes. 

The molybdenum atoms are sandwiched between sulphur 
or selenium atoms so that the top of the structure is made up of 
exposed edges. These sheet-edge structures are reactive, and so 
might make good catalysts, the authors say. As an example, they 
show that a reaction to produce hydrogen gas proceeds more 


€> HIGHLY READ 
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rapidly with a higher concentration of sheet edges. 


Nano Lett. 13, 1341-1347 (2013) 


University in Cambridge, 
Massachusetts, and his group 
began with a particle trap 
designed to measure the 
proton’s magnetic moment 

— the amount of force that 

its magnetic spin can exert. 
They shipped the equipment 
to the world’s largest particle- 
physics laboratory — CERN 
near Geneva, Switzerland 

— where they confined a 
single antiproton from the 
CERN beam and measured 

the strength of its magnetic 
moment. The moment of the 
antiproton differed from that of 
the proton by less than 5 parts 
per million, the most precise 
measurement ever achieved. 
This result is in agreement with 
established theory of symmetry 
between particles and their 
antimatter counterparts. 

Phys. Rev. Lett. 110, 130801 
(2013) 


APPLIED MATHS 


Mechanics behind 
seashell spines 


Spines on seashells, which 
offer protection from the 
crushing jaws of potential 
predators, are often cited 
as an example of similar 
features that have evolved 


independently in different 
species. Now, their shape can 
be explained by the physics of 
shell formation. 

Derek Moulton at the 
University of Oxford, UK, 
and his colleagues modelled 
the interactions between 
the hard edge of the shell 
and the mollusc mantle that 
secretes new shell material 
during growth. Variations in 
growth rates and the stiffness 
of new shell could account 
for the diversity of spine 
shapes found on the shells of 
different species in sea-snail 
families such as Muricidae 
(pictured). 

A mechanical model could 
explain the similarities in 
shell spines seen across 
various species, the authors 
suggest. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1220443110 (2013) 
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SEVEN DAYS nse: 


Desert treaty woes 


Canada withdrew from the 
United Nations Convention 
to Combat Desertification 
(UNCCD) on 28 March, 
leaving 193 countries party 

to the legally binding treaty. 
Canada is due to pay around 
US$310,000 to the convention 
in 2013 — just over 3% of 

the convention's budget. The 
nation will honour its 2013 
financial commitments, say 
media reports. The move 
comes ahead of conferences to 
be held under the aegis of the 
convention this month; where 
the first ever cost-benefit 
analysis of desertification and 
sustainable land management 
will be presented. 


UK open access 

Rules making publicly 

funded research in the United 
Kingdom free to read came 
into force on 1 April. This 
year, universities and research 
institutions are expected to 
openly publish at least 45% 

of any research funded with 
grants from the UK research 
councils. Results will be made 
available either immediately or 
after a delay of between 6 and 
24 months, depending on the 
research subject. This policy 

is due for review in 2014. See 
nature.com/scipublishing for 
Nature’s special on open access 
and the future of publishing. 


China spring-clean 
Beijing is spending 100 billion 
renminbi (US$16 billion) on 
a three-year effort to clean up 
the city’s air, land and water. 
The plan, announced on 

28 March, includes measures 
such as shutting down illegal 
construction sites, setting up 
water-recycling plants and 
establishing an emergency 
headquarters to deal with 
extreme air-quality problems. 
Shanghai launched its 

own air-pollution plan on 


US waterways in bad 


More than half of US rivers and streams are in 
a poor environmental condition, according 

to a survey published by the Environmental 
Protection Agency on 26 March. The data 
from 2008-09 — the most recent available — 
show that 28% of the nation’s waterways have 
excessive levels of nitrogen, and 40% have high 
levels of phosphorus. This nutrient pollution 


1 April, providing emergency 
measures for when air quality 
declines below a set target. 
These include halting building 
projects and swapping coal 

for natural-gas in power 
generation. 


Climate change 

More than 80% of Americans 
believe that the planet is 
warming and think that 
coastal communities should 
be better prepared for rising 
seas and stronger storms, a 
survey published on 28 March 
has found. However, the poll, 
commissioned by scientists 

at Stanford University in 
California, also found that the 
US public does not want the 
government to pay for climate- 
change adaptation. Most of 
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the 1,174 people surveyed 
said that those living in at-risk 
areas should bear the costs of 
making their communities 
more resilient. See go.nature. 
com/z23dot for more. 


HeLa-cell debate 


An ethical controversy 
erupted last week over 
whether to publish genetic 
data on old tissue samples 
that were originally 
established without their 
donor’s consent. Researchers 
hurriedly removed from 
public databases their recently 
published genome of the 
world’s most famous cell line, 
HeLa, which was established 
without consent from 
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shape 


causes algal blooms (pictured; near the Iron 
Gate dam on the Klamath River in northern 
California), the breakdown of which saps 
water of the oxygen that aquatic life needs 
to survive. “This is the most dire in a string 
of water assessments over the last 20 years,” 
says Earthjustice, an environmental law 
organization in San Francisco, California. 


Henrietta Lacks around 1951 
(when consent was not legally 
required). Lacks’s descendants, 
together with bioethicists 

and other scientists, said that 
it could reveal information 
about traits carried by living 
family members — although 
other genetic data about HeLa 
cells remain available online. 
See go.nature.com/ymiwgi 
for more. 


Deep-sea dive 


The film-maker James 
Cameron is donating his 
deep-sea submersible to the 
Woods Hole Oceanographic 
Institution in Massachusetts, 
he announced on 26 March. 
Cameron used the craft, the 


AURORA PHOTOS/ALAMY 


@ DEEPSEA CHALLENGER, 

2 to seta record last year for 

the deepest single-person 

dive when he travelled to 

2 the bottom of the Mariana 

= Trench, reaching a depth of 
10,900 metres. The DEEPSEA 
CHALLENGER is the only 
submersible capable of 
carrying a human deeper than 
7,000 metres. See go.nature. 
com/hhsks8 for more. 


Bird flu deaths 


Two men have died after being 
infected with a type of bird flu 
never before seen in humans, 
China reported on 31 March. 
The men, aged 27 and 87 and 
from Shanghai, contracted the 
H7N9 virus in February and 
died within 8 days. A 35-year- 
old woman from Anhui 
province, who contracted 

the virus in March, was ina 
critical condition as Nature 
went to press. See go.nature. 
com/vrjulj for more. 


YOSHI CHIB. 


Locust plague 

The worst locust plague to 

hit Madagascar in decades 
prompted the United Nations 
on 26 March to call for 

US$41 million in funding 
from donors to implement 
immediate emergency and 
longer-term control measures. 
Around half the country is 
already affected by the pests 
(pictured), which threaten 
large swaths of the island’s rice 


sea ice in the Arctic last 


bya record refreezing of 
uncovered ocean surface, 


on 15 March that is still the 
sixth-lowest recorded since 


TREND WATCH 


A record low in the extent of 


September has been followed 


resulting in a winter maximum 


SOURCE: US NATIONAL SNOW AND ICE DATA CENTER 


satellite measurements began 
in 1979. The US National Snow 
and Ice Data Center in Boulder, 
Colorado, said that the data 
indicate “a more pronounced 
seasonal cycle” and “the 
increasing dominance of first- 
year ice in the Arctic’. 


production and could thrust 
more than 10 million residents 
into hunger. Called-for 
control efforts include aerial 
spraying of some 1.5 million 
hectares and improved aerial 
and ground surveillance to 
monitor the threat. 


| BUSINESS 
Patent blocked 


India’s Supreme Court ruled on 
1 April against an attempt by 
Swiss drug company Novartis 
to patent the anticancer drug 
Gleevec (imatinib mesylate). 
India’s patent law limits the 
ability of drug firms to make 
minor modifications to drugs 
to extend their patent life, a 
tactic known as evergreening. 
The patent claim by Basel- 
based Novartis on a modified 
version of Gleevec “fails in 
both the tests of invention 

and patentability’, India’s high 
court said. The nation’s support 
for cheap generic drugs has 
reduced prices, notably those 


ARCTIC ICE MAXIMUM 


ofanti-HIV medicines. But 
drug companies complain 

that it weakens intellectual- 
property rights and stifles 
innovation. See go.nature.com/ 
csqesi for more. 


GM crop deal 

Two agricultural 
biotechnology giants agreed 
on 26 March to settle a lengthy 
legal battle over patent rights 
to ‘next-generation genetically 
modified (GM) soya beans. 
DuPont, based in Wilmington, 
Delaware, will pay Monsanto, 
based in St Louis, Missouri, at 
least US$1.75 billion over the 
next decade for the right to 
offer two herbicide-tolerant 
lines of soya bean. Monsanto 
will gain access to some 
DuPont patents covering 
disease resistance and maize 
(corn) defoliation. 


Diabetes drug 


US regulators have given the 
green light to the first ina new 
class of drugs to treat diabetes. 
On 29 March, the Food 

and Drug Administration 
approved canagliflozin 
(Invokana), which was 
developed by Johnson & 
Johnson, headquartered in 
New Brunswick, New Jersey. 
Canagliflozin blocks sugar 
reabsorption in the kidneys by 
inhibiting the sodium/glucose 
cotransporter 2 (SGLT2) 
protein, increasing the amount 
of sugar patients excrete in 
their urine. Companies have 


Sea-ice extent in the Arctic has reached its sixth-lowest 


maximum in the satellite record. 
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SEVEN DAYS | THIS WEEK | 


6-10 APRIL 
Discussions on mapping 
the epigenomes of 
cancer take place at the 
American Association 
for Cancer Research's 
annual meeting in 
Washington DC. 
go.nature.com/wiozhm 


7-11 APRIL 

Plenary talks about the 
sustainability of the 
world’s food system 
feature at the American 
Chemical Society's 
Spring Meeting in New 
Orleans, Louisiana. 
go.nature.com/xgylsf 


8-19 APRIL 

The United Nations 
Forum on Forests meets 
in Istanbul, Turkey; it 
will discuss progress 

on a 2007 commitment 
to manage forests 
sustainably. 
go.nature.com/potx5i 


been racing to bring an SGLT2 
inhibitor to market; last year 
regulators shot down another 
such drug, dapagliflozin, 
citing cancer concerns. 


PEOPLE 


NICE head departs 


The head of the National 
Institute for Health and Care 
(formerly Clinical) Excellence 
(NICE) in London stepped 
down on 31 March. Michael 
Rawlins has led the body, 
which is tasked with assessing 
the cost-effectiveness of 
medicines, since 1999. The 
government confirmed last 
month that NICE will havea 
central role in a new pricing 
scheme for medicines under 
which, from 2014, drug 
companies will be paid what 
their products are deemed 

to be worth. For more see 
go.nature.com/37|su2. 
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Francis Crick’s co-discovery of the structure of DNA laid the foundations of molecular biology. 


SCIENCE HISTORY 


Crick’s medal goes 
under the hammer 


Pending auction raises eyebrows but few objections. 


BY EWEN CALLAWAY 


efore he and James Watson made the 
B defining discovery of the twentieth 
century, Francis Crick was an impover- 
ished PhD student. The future Nobel laureate 
twice left his typewriter with a pawnbroker 
in exchange for a loan, and his family helped 
Crick and his wife Odile to make ends meet. 
Now he is — posthumously — repaying that 
generosity. Crick’s family stands to gain more 
than a million dollars after his Nobel medal 
and other memorabilia go to the auction block 
next week in New York. The sale has been met 
with speculation and curiosity, but also with 
some shrugs. Historians are less interested in 


the glittery prize that Crick won six decades 
ago than in some faded bits of paper that are 
also up for bidding: a letter from Crick to his 
son describing the landmark 1953 discovery. 
Public attention, however, has focused on 
Crick’s Nobel medal since its 11 April sale was 
announced. The 23-carat-gold medallion was 
awarded for the 1962 Nobel Prize in Physiol- 
ogy or Medicine, which Crick shared with 
Watson and Maurice Wilkins for their work on 
determining the molec- 
ular structure of DNA 


and its role in heredity.  Formore onCrick’s 
Crick put the medalina memorabilia and the 
family trust in 1990. full letter to his son: 


Heritage Auctions, a 
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firm based in Dallas, Texas, that specializes 
in coins and other historical memorabilia, 
estimates that the medal will fetch more than 
US$500,000. The company is also selling an 
endorsed cheque in Swedish kronor awarded 
for the prize; a lab coat perplexingly adorned 
with a single — not double — golden helix; 
several of Crick’s books; and his gardening 
journal and nautical logbook. (He was an avid 
gardener and used his Nobel money to buy a 
half-share in a 14-metre yacht.) 

One day before this sale, auctioneer 
Christie's will sell the seven-page, handwritten 
letter that Crick sent to his 12-year-old son 
Michael in 1953 describing the structure of 
DNA. “Jim Watson and I have probably made 
a most important discovery,’ begins the letter, 
which includes a sketch of the double-helical 
structure and Crick’s inference on how the 
molecule replicates. “You can understand that 
we are very excited.” 

Christie’s estimates that the letter will fetch 
between $1 million and $2 million. “Tt’s ahome- 
run kind of letter,’ says Francis Wahlgren, 
international head of rare books and manu- 
scripts at the auction house. In assessing the 
letter’s worth, he cites its scientific impor- 
tance — the letter is one of the first written 
descriptions of DNA’s structure, preceding its 
scientific publication by weeks — as well as 
its personal sentiment. Half of the letter’s sale 
money will be donated to the Salk Institute for 
Biological Studies in La Jolla, California, where 
Crick ended his career. And at least 20% of the 
money from the sale of the medal and the other 
items will benefit the Francis Crick Institute, 
a biomedical centre under construction in 
Central London. 

Kindra Crick, a granddaughter of the laureate 
who is an artist in Portland, Oregon, says that 
the medal has been in storage for decades, 
and that her family hopes a buyer will make 
it available for public display. “It’s challenging 
to know what to do with something like this,” 
she says. “We pretty much figured that the 
types of people who would want to bid on this 
unique part of science history would have the 
resources to make it available for public display 
and not keep it locked up” 

“That is not at all guaranteed,” says Soraya 
de Chadarevian, a science historian at the Uni- 
versity of California, Los Angeles. Still, she sees 
the medal as more of a collector’s item than an 
important artefact from the dawn of molecu- 
lar biology. The letter to Crick’s son would 
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> beof much more interest to historians, 
she says, because it contains an early sketch 
of DNA. That, she adds, “would be much 
better placed in an archive”. 

In the early 2000s, Crick nearly sold his 
papers to a private buyer. Watson urged 
him to reconsider the sale, and in 2001, 
the Wellcome Trust, a London-based 
biomedical charity, purchased the papers 
for £1.6 million (US$2.4 million). The 
Wellcome Library recently launched a 
website containing digitized versions of the 
collection. 

Simon Chaplin, director of the Wellcome 
Library, says that the Crick memorabilia 
now at auction are a “low priority” for Well- 
come. “I think it would be nice ifthe medal 
could be available to the public, but I feel it’s 
more important to present the substance of 
somebody’s research than to focus on indi- 
vidual items that may be intrinsically valu- 
able,” he says. He suggests that the medal 
and other items would be better suited 
to London's Science Museum: its collec- 
tion includes a reconstruction of Watson and 
Crick’s first model of the DNA double helix, 
complete with some pieces from the original 
model, which was destroyed. 

Robert Bud, the museum's principal cura- 
tor of medicine, says “it’s our business whether 


“A most important discovery,” wrote Crick to his son. 


we're going to bid for it’, but that “if somebody 
were to give it to the Science Museum I would 
doubt we would reject it” Museums, adds Bud, 
can make scientific memorabilia more worth- 
while by presenting objects within a larger 
story. “It makes the past real, and enables you 


to believe that the future with which youre 
working has a real ancestry,’ he says. 

It is uncommon — although not unheard 
of — for scientific collectables to wind up 
at auction, adds Bud. In 1997, his museum 
bought a preserved penicillin mould, pre- 
pared by Alexander Fleming, for around 
$25,000 from Christie’s. And in November 
2012, a Danish auction house garnered 
280,000 Danish kroner (US$48,000) for 
the Nobel medal that Aage Niels Bohr 
won in 1975 for his theoretical work on 
the structure of the atomic nucleus. His 
father Niels Bohr, a pioneer of the field of 
quantum physics, and August Krogh, the 
1920 medicine laureate, auctioned their 
medals in 1940 to raise money for a fund to 
aid the Finnish people during the Second 
World War. An anonymous buyer has since 
given the medals to the Danish Historical 
Museum in Fredriksborg. 

Neuroscientist Charles Stevens, a former 
colleague of Crick’s at the Salk Institute, isn’t 
bothered by the family’s decision to auction 

the medal. “The things that I would treasure 
from him are things he actually did,” he says. 
Nor does he think that Crick would mind. 
“Francis didn’t make the medal — he just won 
it. ’'m sure he was glad he got it, but I doubt he 
would be very sentimental.” = 
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BIOMEDICINE 


Sizing up a slow 
assault on cancer 


Rise of immunotherapies spurs search for markers of response. 


BY HEIDI LEDFORD 


into the examination room to deliver bad 

news to his patient. Scans showed that the 
man’s advanced melanoma had spread, and 
new tumours had sprouted, even though he 
had received an experimental therapy called 
ipilimumab (Yervoy) to rally his immune sys- 
tem against the disease. “In my mind I was 
rehearsing the standard speech,’ says Wolchok, 
an oncologist at the Memorial Sloan-Kettering 
Cancer Center in New York. “Tm sorry Mr So- 
and-so, but the treatment doesn’t seem to be 
working:” Instead, the patient stopped Wolchok 
at the door. “Now before you show me your pic- 
tures, let me tell you: I feel better,” he said. 

Wolchok took a gamble, and continued the 
therapy. More than six years later, his patient is 
thriving, and the drug, made by Bristol-Myers 


Jes Wolchok braced himself as he walked 
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Squibb of New York, has been approved by 
the US Food and Drug Administration. In 
that time, other experimental immunothera- 
pies have shown promise, and clinicians have 
become increasingly familiar with delayed 
responses such as the one Wolchok’s patient 
experienced — which reflect a gradual attack 
by the immune system, in contrast to the rapid 
onslaught of toxic chemotherapies. They still 
have no tools to quickly discern which patients 
are benefiting from treatment, but that may be 
about to change. 

At the annual meeting of the American Asso- 
ciation for Cancer Research in Washington DC 
this week, Wolchok and other researchers will 
report on their search for immunotherapy 
markers — ways to predict a patient’s response 
to an immunotherapy or to show whether a 
given treatment is working. The work is ham- 
pered by the complexity of the immune system, 
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but early results are converging on one point: 
that patients’ own immune responses to cancer 
are crucial in determining outcomes. 

“When you sum it all up in many differ- 
ent cancers, early and late stage, it looks like 
patients who have a ‘smouldering? initial 
immune response to cancer are more likely to 
benefit from immune therapies,” says Thomas 
Gajewski, a cancer researcher at the University 
of Chicago in Illinois. Early work suggests that 
such patients may fare better, irrespective of 
the therapy used to treat them, he says. The 
evidence is fuelling a push to add an immune 
component to standard tumour pathol- 
ogy evaluations (see “T cells can brighten 

prognosis’). 


“It looks Reliably gauging 
like patients the initial response, 
who have a however, is a chal- 
‘smouldering lenge, cautions Mario 
initialresponse —Sznol, an oncolo- 
to cancer are gist at Yale Univer- 
more likely to sity in New Haven, 


Connecticut. The 
immune response 
is a moving target, influenced not only by a 
patient’s genetic make-up and environment, 
but also by proteins secreted by the tumour 
that can suppress immune cells. Immune 
markers present in the blood may not reflect 
what is happening in the tumour, and variation 
within a tumour means that biopsies may not 
paint a full picture either, if the needle misses 


benefit.” 


OBSTETRICS & GYNAECOLOGY/SPL 
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CANCER STAGES 


T cells can brighten prognosis 


For more than half a century, pathologists 
have assessed the severity of a cancer using 
the same basic framework: tumour size, 
the cancer’s presence in lymph nodes and 
whether it has spread to other parts of the 
body. Called ‘TNM staging’, the framework 
ranges from stage 1, which has a relatively 
good prognosis, to stage 4, which can 
require immediate and aggressive therapy. 
But researchers are now wondering 
whether it is time to add an ‘i’ category to 
the test — by factoring in whether immune 
cells are present in or around the tumour. In 
2006, Jér6me Galon of the National Institute 
of Health and Medical Research in Paris 
and his colleagues found that the presence 
of T cells in or around colon cancers was 
associated with better patient outcomes’. 
They then applied the screen to samples 
from 599 patients and found that the 
immune response predicted outcomes more 


any patches of immune cells that have infil- 
trated the tumour. 

Some are hoping that new methods will 
fill the gap. Adaptive Biotechnologies, based 
in Seattle, Washington, uses intensive DNA 
sequencing of blood and tissue samples to 
profile the changes in specialized receptors 
found in populations of immune cells called 
T cells. These cells make the receptors by shuf- 
fling variable regions of DNA in response to 
antigen exposure. Chief executive Chad Rob- 
ins says that the company hopes the assay will 
be sensitive enough that blood samples alone 
could be used to monitor patients’ responses 
to immunotherapy. Demand for the tech- 
nique has skyrocketed in the past year, he says, 
as more and more companies seek to test its 
potential to guide immunotherapy decisions. 

Suzanne Topalian, an oncologist at Johns 
Hopkins University in Baltimore, Maryland, 
focuses instead on the cancer cells. A number of 
the world’s biggest pharmaceutical companies 
are developing therapies that release a brake 
on the immune system by inhibiting a pro- 
tein called programmed death 1 (PD-1) or its 


accurately than the TNM staging system®. 

The results have spawned an effort to test 
the ‘immunoscore’ approach at 23 sites in 
17 countries. Researchers will mine stored 
specimens and associated hospital records 
for 6,000 colon-cancer patients. Results are 
expected in the autumn. Bernard Fox, who 
studies immunotherapies at Oregon Health & 
Science University in Portland, hopes that the 
test will also work for other cancers in which 
T cells can infiltrate tumours, such as breast 
and ovarian cancer. But he worries that 
biopsies for these other cancers are often not 
as large — in colon cancer, the entire tumour 
plus surrounding tissue is typically taken. 

Yet if the colon-cancer results hold up 
at scale, it could prompta radical change 
in thinking, he says. “In the current staging 
system, pathologists consider immune cells 
a contaminant,” says Fox. “Isn’t it incredible 
that they may be the key to survival?” HLL. 


partner, programmed death ligand 1 (PD-L1). 
A study by Topalian and her colleagues’ sug- 
gests that these therapies work best if at least 
5% ofa patient’s biopsied tumour cells express 
PD-L1. Some companies have already made 
PD-L1 expression a requirement for partici- 
pation in their clinical trials — a move that 
both Topalian and Wolchok worry is prema- 
ture. “I certainly would not want to exclude a 
patient from a trial for a PD-1-pathway block- 
ing drug,” says Wolchok. “Even if one tumour 
biopsy is negative, the patient may have other 
tumours that are positive.” 

While Topalian and others hunt for use- 
ful immunotherapy markers, academic labs 
are trying to understand what ignites the 
smouldering immune response to cancer in 
the first place. “Why do some patients have 
this response when others don’t?” asks Sznol. 
“Understanding this could lead us to new 
therapies.” m 


1. Topalian, S. L. et al. N. Engl. J. Med. 366, 2443-2454 
(2012). 

2. Galon, J. et al. Science 313, 1960-1964 (2006). 

3. Mlecnik, B. et al. J. Clin. Oncol. 29, 610-618 (2011). 
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UNDER THE SEA 


Researchers want to put a geoneutrino detector 
on the ocean floor to avoid confusing signals 
from nuclear reactors and Earth’s crust. 
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TNU: Terrestrial neutrino units. 1 TNU corresponds to 
1 geoneutrino event recorded over a year-long fully efficient 
exposure to 10° free protons. 


Detectors zero in 
on Earth’s heat 


Geoneutrinos paint picture of deep-mantle processes. 


BY ALEXANDRA WITZE 


window on the deep Earth opened 
Ainezeceat in 2011, when Japan's 
nuclear reactors were shut down after 
the Fukushima disaster. Before the closure, an 
underground particle detector called Kam- 
LAND based in Kamioka, Japan, was moni- 
toring a torrent of neutrinos streaming from 
dozens of nearby nuclear reactors, seeking 
clues to the nature of these hard-to-catch sub- 
atomic particles. After those plants fell silent, 
KamLAND scientists could see more clearly a 
signal that had largely been obscured: a faint 
trickle of neutrinos produced inside the planet. 
Neutrinos are generated in stars, reactors, 
and deep in Earth’s crust and mantle by 
the radioactive decay of elements such as 
uranium and thorium. KamLAND reported 
the first tentative detections of these ‘geo- 
neutrinos in 2005 (ref. 1). But last month at 
aconference in Takayama, Japan, KamLAND 
scientists reported seeing them in meaning- 
ful quantities — as did a team at the Borexino 
neutrino detector at the Gran Sasso National 
Laboratory near L- Aquila, Italy. 
These detections are not just curiosities. 
Geoneutrinos offer the only way to measure 
one of Earth’s internal heat sources. The total 


heat flow, measured with sensors in deep 
mines and amounting to 47 terawatts (TW) 
of power, drives everything from plate tecton- 
ics to Earth's magnetic field. Some of it comes 
from the decay of radioactive elements, the rest 
is primordial heat left over from when Earth 
was formed by the violent collision of plan- 
etary building blocks. 

But no one knows the proportions. Geologists 
assume that Earth contains the same amount 
of radioactive elements as certain primitive 
meteorites, but they aren't sure. “We're after 
trying to understand how Earth was built,’ says 
William McDonough, a geologist at the Univer- 
sity of Maryland in College Park. 

Enter KamLAND and Borexino, which 
spot geoneutrinos as a sideline to their other 
neutrino studies. Both experiments use 
liquid scintillator detectors, in which huge vats 
of fluid capture the occasional sparkle of light 
when a passing neutrino interacts with atomic 
nuclei in the liquid. 

The team at Borexino, a vat containing 
300 tonnes of liquid buried under the Italian 
Alps, captured 14 candidate geoneutrinos 
between December 2007 and August 2012 
(ref. 2). Scientists at KamLAND, with 
1,000 tonnes of liquid, say that they detected 
116 probable geoneutrinos between March 
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2002 and November 2012 (ref. 3). 

That’s just enough for researchers to start 
drawing conclusions about the composition 
of Earth’s mantle, says McDonough. Assum- 
ing that uranium and thorium are spread uni- 
formly in the mantle, the KamLAND findings 
suggest that about 11 of the 47 TW come from 
the radioactive decay of those elements. A simi- 
lar calculation for Borexino yields about 18 TW. 

Ultimately, geoneutrino researchers would 
like multiple detectors spaced around Earth, 
so that they could perform a sort of tomogra- 
phy on the mantle. That could help scientists 
to discern between models that favour the ura- 
nium and thorium being spread throughout 
the mantle, versus those in which the elements 
are concentrated near the core-mantle bound- 
ary. Such a difference could help to determine 
where and how long heat will continue to flow 
to drive geological processes such as plate tec- 
tonics — and how long it will take Earth to cool. 

One challenge is that emissions from ura- 
nium and thorium much nearer the surface in 
the continental crust can mask the geoneutrino 
signal coming from deeper in the planet (see 
‘Under the sea’). Next year, for example, the 
retrofitted Sudbury Neutrino Observatory 
(SNO) in Ontario, Canada, will start taking 
data with a 780-tonne detector that is sensitive 
to geoneutrinos. But SNO+, as the upgrade is 
called, sits smack in the middle of continental 
crust. Separating crustal from mantle geo- 
neutrinos is crucial, says Steve Dye, a physi- 
cist at Hawaii Pacific University in Honolulu, 
as “the mantle is really what contributes to the 
rate of cooling of the planet”. 

Dye and others say that the best way to 
catch mantle geoneutrinos would be from the 
ocean floor, where the crust is thinner than 
on land. One scheme, dubbed Hanohano, 
would lower a 10,000-tonne detector from 
a barge, and has been on the drawing board 
for years. Construction alone would cost 
some US$50 million to $60 million, says John 
Learned, a neutrino physicist at the Univer- 
sity of Hawaii at Manoa in Honolulu, and the 
technology is ambitious. 

“We've never done anything like this before,” 
he says. But interest in the project is growing, 
he adds, and supporters are trying to drum up 
funds to keep it moving. 

Meanwhile, China is working on its Daya 
Bay Il experiment, a 20,000-tonne detector on 
land that could be ready to hunt for geoneutri- 
nos in 2019. Borexino has funds to run for at 
least another four years. And KamLAND plans 
to keep going for at least five more years, says 
team member Hiroko Watanabe of Tohoku 
University in Sendai, Japan. Even after Japan's 
nuclear reactors restart, the detector will still be 
able to find geoneutrinos — just not as easily. m 


1. Araki, T. et al. Nature 436, 499-503 (2005). 

2. Bellini, G. et al. Preprint at http://arxiv.org/ 
abs/1303.2571 (2013). 

3. The KamLAND Collaboration. Preprint at http:// 
arxiv.org/abs/1303.4667 (2013). 
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Detective work uncovers 
under-reported overfishing 


Excessive catches by Chinese vessels threaten livelihoods and ecosystems in West Africa. 


BY CHRISTOPHER PALA 


one: China is under-reporting its overseas 

fishing catch by more than an order of 
magnitude, according to a study’ published on 
23 March. The problem is particularly acute in 
the rich fisheries of West Africa, where a lack of 
transparency in reporting is threatening efforts 
to evaluate the ecological health of the waters. 

“We cart assess the state of the oceans with- 
out knowing what’s being taken out of them,” 
says Daniel Pauly, a fisheries scientist at the 
University of British Columbia in Vancouver, 
Canada, who led the study. The unreported 
catch is crippling the artisanal fisheries that 
help to feed West African populations, he says. 

Fisheries experts have long suspected that 
the catches reported by China to the Food 
and Agriculture Organization of the United 
Nations (FAO) in Rome are too low. From 2000 
to 2011, the country reported an average over- 
seas catch of 368,000 tonnes a year. Yet China 
claims to have the world’s biggest distant-water 
fishing fleet, implying a much larger haul, says 
the study, which was funded by the European 
Union (EU). Pauly and his colleagues estimate 
that the average catch for 2000-11 was in fact 
4.6 million tonnes a year, more than 12 times 
the reported figure (see ‘A colossal catch’). Of 
that total, 2.9 million tonnes a year came from 
West Africa, one of the world’s most produc- 
tive fishing grounds. 

Liu Xiaobing, director of the division of 
international cooperation of China’s bureau 
of fisheries, put the yearly overseas catch at 
1.15 million tonnes in a speech to the EU last 
June. Pauly says that figure would be accurate 
if it referred to the amount brought back to 
China, rather than the total catch. Liu did not 
respond to an e-mailed request for comment. 

Fisheries scientists find the latest assessment 
startling. “So that’s where my fish were going!” 
says Didier Gascuel at the European University 
of Brittany in Rennes, France, who is a mem- 
ber of the scientific committee that advises 
Mauritania and the EU on fishing agreements. 
Year after year, Mauritanian populations 
of bottom-dwelling species such as octo- 
pus, grouper and sea bream have remained 
stubbornly low — a sign of overfishing by 
bottom-scraping trawlers, he says. “We had no 
idea the Chinese catch was so big and of course 


I tis a whopper ofa catch, in more ways than 
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A COLOSSAL CATCH 


Revised estimates for China's overseas fish catch show 
that it is taking a total of some 4.6 million tonnes per 
year, 12 times what is reported to the United Nations. 


Central and : 


South America 
182,000 tonnes 
(US$44 million) 


Antarctica 
48,000 tonnes — 
(US$7.8 million) 


we never included it our models,” he says. 

Fishing contracts between Chinese compa- 
nies and African nations are secret, so to esti- 
mate the catch, Pauly and his team had to do 
some sleuthing. The picture was further clouded 
because Chinese companies sometimes operate 
vessels flying local flags. So at least ten research- 
ers combined clues from field interviews, schol- 
arly articles and newspaper and online reports 
in 14 languages to estimate how many Chinese 
fishing vessels were operating in 93 countries 
and territories. They found many in nations 
where China reported no catch. The estimates 
were averaged to reach their conclusion: China 
had at least 900 ocean-going vessels, with 345 
in West Africa, including 256 bottom-trawlers. 

The scientists estimated the catch per coun- 
try on the basis of an assumed average catch 
for each type of vessel. “These numbers may 
not be absolutely exact, but they give the first 
hint of the magnitude of the problem,” says 
Boris Worm, a marine ecologist at Dalhousie 
University in Halifax, Canada, who was not 
involved in the study. 

Other experts are sceptical. “The new 
estimates seem far, far too high,” says Rich- 
ard Grainger, chief of the fisheries statistics 
and information service at the FAO. He notes 

that a previous assess- 


NATURE.COM ment’ estimated the total 
For more on unreported catch in West 
assessing fish Africa (by all nations) at 
stocks, see: 300,000-560,000 tonnes 
go.nature.com/9pjxin a year. That study tried 
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South Korea, Japan 
106,000 tonnes 
(US$22 million) 


Asia (excl. Japan, 
South Korea) 
948,000 tonnes 
(US$2.45 billion) 


: East Africa 


181,000 tonnes 
(US$50.5 million) 
Oceania 
198,000 tonnes 
(US$71.2 million) 


to identify what was missing from official 
catch figures with a review of English-language 
scientific studies. 

If the new numbers stand up, renewals of 
fishing contracts with West African nations 
could be affected. In the 2000s, under public 
pressure, EU fleets stopped fishing in coastal 
waters off much of West Africa, except Mau- 
ritania and Morocco. They were replaced by 
Chinese vessels, mostly large bottom-trawlers 
whose violations of near-shore no-fishing 
zones have led to protests. 

Gascuel, who helps to determine how many 
fish can be caught while avoiding population 
collapse, says that numbers of octopus and 
shrimp available to be taken in EU contracts 
with Mauritania, primarily by Spanish vessels, 
were already small. But once the actual Chi- 
nese catch is factored in, he says, “wed have to 
eliminate the Spanish catch”. 

Ironically, it was Pauly’s team that 12 years 
ago found that China had been over-reporting 
its domestic catch by at least 6 million tonnes. 
Pauly says that mid-level bureaucrats in the 
country often exaggerate their achievements’. 

But he says that China’s under-reporting of 
the distant-water catch is the more important 
problem. “It shows the extent of the looting of 
Africa, where so many people depend on sea- 
food for basic protein.” m 


1. Pauly, D. et al. Fish Fish. http://dx.doi.org/10.1111/ 
faf.12032 (2013). 

2. Agnew, D. J. et al. PLoS ONE 4, e4570 (2009). 

3. Watson, R. & Pauly, D. Nature 414, 534-536 (2001). 
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n March 2012, Joseph Polchinski began to contemplate suicide 
— at least in mathematical form. A string theorist at the Kavli 
Institute for Theoretical Physics in Santa Barbara, California, 
Polchinski was pondering what would happen to an astronaut 
who dived into a black hole. Obviously, he would die. But how? 
According to the then-accepted account, he wouldn't feel 
anything special at first, even when his fall took him through the 
black hole’s event horizon: the invisible boundary beyond which 
nothing can escape. But eventually — after hours, days or even 
weeks if the black hole was big enough — he would begin to notice 
that gravity was tugging at his feet more strongly than at his head. 
As his plunge carried him inexorably downwards, the difference 
in forces would quickly increase and rip him apart, before finally 
crushing his remnants into the black hole’s infinitely dense core. 
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But Polchinski’s calculations, carried out with two of his 
students — Ahmed Almheiri and James Sully — and fellow 
string theorist Donald Marolf at the University of California, 
Santa Barbara (UCSB), were telling a different story’. In their 
account, quantum effects would turn the event horizon into 
a seething maelstrom of particles. Anyone who fell into it 
would hit a wall of fire and be burned to a crisp in an instant. 

The team’s verdict, published in July 2012, shocked the 
physics community. Such firewalls would violate a foun- 
dational tenet of physics that was first articulated almost 
a century ago by Albert Einstein, who used it as the basis 
of general relativity, his theory of gravity. Known as the 
equivalence principle, it states in part that an observer 
falling in a gravitational field — even the powerful one 
inside a black hole — will see exactly the same phenom- 
enaas an observer floating in empty space. Without this 
principle, Einstein’s framework crumbles. 

Well aware of the implications of their claim, 
Polchinski and his co-authors offered an alternative 
plot ending in which a firewall does not form. But this 
solution came with a huge price. Physicists would have 
to sacrifice the other great pillar of their science: quan- 
tum mechanics, the theory governing the interactions 
between subatomic particles. 

The result has been a flurry of research papers about 
firewalls, all struggling to resolve the impasse, none succeed- 
ing to everyone's satisfaction. Steve Giddings, a quantum 
physicist at the UCSB, describes the situation as “a crisis in the 
foundations of physics that may needa revolution to resolve”. 

With that thought in mind, black-hole experts came 
together last month at CERN, Europe's particle-physics labo- 
ratory near Geneva, Switzerland, to grapple with the issue 
face to face. They hoped to reveal the path towards a unified 
theory of ‘quantum gravity’ that brings all the fundamen- 
tal forces of nature under one umbrella — a prize that has 
eluded physicists for decades. 

The firewall idea “shakes the foundations of what most 
of us believed about black holes”, said Raphael Bousso, a 
string theorist at the University of California, Berkeley, as he 
opened his talk at the meeting. “It essentially pits quantum 
mechanics against general relativity, without giving us any 
clues as to which direction to go next.” 


FIERY ORIGINS 

The roots of the firewall crisis go back to 1974, when 
physicist Stephen Hawking at the University of Cambridge, 
UK, showed that quantum effects cause black holes to run 
a temperature’. Left in isolation, the holes will slowly spew 
out thermal radiation — photons and other particles — and 
gradually lose mass until they evaporate away entirely (see 
“The information paradox’). 

These particles aren't the firewall, however; the subtleties 
of relativity guarantee that an astronaut falling through the 
event horizon will not notice this radiation. But Hawking’s 
result was still startling — not least because the equations of 
general relativity say that black holes can only swallow mass 
and grow, not evaporate. 

Hawking’s argument basically comes down to the 
observation that in the quantum realm, ‘empty’ space isn’t 
empty. Down at this sub-sub-microscopic level, it is in con- 
stant turmoil, with pairs of particles and their correspond- 
ing antiparticles continually popping into existence before 
rapidly recombining and vanishing. Only in very delicate 
laboratory experiments does this submicroscopic frenzy 
have any observable consequences. But when a particle-anti- 
particle pair appears just outside a black hole’s event horizon, 


Hawking realized, one member could fall in before the two 
recombined, leaving the surviving partner to fly outwards as 
radiation. The doomed particle would balance the positive 
energy of the outgoing particle by carrying negative energy 
inwards — something allowed by quantum rules. That nega- 
tive energy would then get subtracted from the black hole’s 
mass, causing the hole to shrink. 

Hawking’s original analysis has since been refined and 
extended by many researchers, and his conclusion is now 
accepted almost universally. But 


with it came 

the disturbing realization that 
black-hole radiation leads to a paradox that chal- 
lenges quantum theory. 

Quantum mechanics says that information cannot be 
destroyed. In principle, it should be possible to recover 
everything there is to know about the objects that fell ina 
black hole by measuring the quantum state of the radiation 
coming out. But Hawking showed that it was not that sim- 
ple: the radiation coming out is random. Toss in a kilogram 
of rock or a kilogram of computer chips and the result will 
be the same. Watch the black hole even until it dies, and 
there would still be no way to tell how it was formed or 
what fell in it. 

This problem, dubbed the black-hole information 
paradox, divided physicists into two camps. Some, like 
Hawking, argued that the information truly vanishes when 
the black hole dies. If that contradicted quantum laws, then 
better laws needed to be found. Others, like John Preskill, a 
quantum physicist at the California Institute of Technology 
in Pasadena, stuck by quantum mechanics. “For a time, I 
did seriously try to build an alternative theory that included 
information loss,” he says. “But I couldn't find one that made 
any sense — nobody could.” The stalemate continued for 
the next two decades, finding its most famous expression in 
1997, when Preskill publicly bet Hawking that information 
was not being lost, with the winner to receive an encyclo- 
paedia of his choice. 

But that same year, the deadlock was broken by a discovery 
made by Juan Maldacena, a physicist then at Harvard Univer- 
sity in Cambridge. Maldacena’s insight built on an earlier pro- 
posal that any three-dimensional (3D) region of our Universe 
can be described by information encoded on its two-dimen- 
sional (2D) boundary*”, in much the same way that laser 
light can encode a 3D scene ona 2D hologram. “We used the 
word ‘hologram’ as a metaphor,’ says Leonard Susskind, a 
string theorist at Stanford University in California, and one 
of those who came up with the proposal’. “But after doing 
more mathematics, it seemed to make literal sense that the 
Universe is a projection of information on the boundary.” 

What Maldacena came up with was a concrete mathematical 
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formulation’ of the hologram idea that made use of ideas from 
superstring theory, which posits that elementary particles are 
composed of tiny vibrating loops of energy. His model envis- 
ages a 3D universe containing strings and black holes that 
are governed only by gravity, bounded by a 2D surface on 
which elementary particles and fields obey ordinary quantum 
laws without gravity. Hypothetical residents of the 3D space 
would never see this boundary because it is infinitely far away. 
But that wouldn't matter: anything happening in the 3D uni- 
verse could be described equally well by equations in the 2D 
universe, and vice versa. “I found that there’s a mathematical 
dictionary that allows you to go back and forth between the 
languages of these two worlds,” Maldacena explains. 

This meant that even 3D black-hole evaporation could be 
described in the 2D world, where there is no gravity, where 
quantum laws reign supreme and where information can 
never be lost. And if information is preserved there, then it 
must also be preserved in the 3D world. Somehow, informa- 
tion must be escaping from the black holes. 


ONE FOR ALL 

A few years later, Marolf showed that every model of quan- 
tum gravity will obey the same rules, whether or not it is built 
from string theory’. “It was a combination of Maldacena and 
Marolf’s work that turned me around,” explains a long-term 
proponent of information loss, Ted Jacobson, a quantum 
physicist at the University of Maryland in Baltimore. In 2004, 
Hawking publicly admitted that he had been wrong, and gave 


“THE EVENT HORIZON IS 
LITERALLY A RING OF FIRE.” 


Preskill a base- 
ball encyclopaedia to make 
good on their bet. 

Such was the strength of Maldacena’s discovery that 
most physicists believed that the paradox had been settled — 
even though nobody had yet explained how Hawking radia- 
tion smuggles information out of the black hole. “I guess we 
just all assumed there would be a straightforward answer,’ 
says Polchinski. 

There wasn’t. When Polchinski and his team set 
themselves the task of clearing up that loose end in early 
2012, they soon stumbled on yet another paradox — the one 
that eventually led them to the fatal firewall. 

Hawking had shown that the quantum state of any one 
particle escaping from the black hole is random, so the par- 
ticle cannot be carrying any useful information. But in the 
mid-1990s, Susskind and others realized that information 
could be encoded in the quantum state of the radiation as a 
whole if the particles could somehow have their states ‘entan- 
gled’ — intertwined in such a way that measurements carried 
out on one will immediately influence its partner, no matter 
how far apart they are. 

But how could that be, wondered the Polchinski’s team? 
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For a particle to be emitted at all, it has to be entangled with 
the twin that is sacrificed to the black hole. And if Susskind 
and others were right, it also had to be entangled with all the 
Hawking radiation emitted before it. Yet a rigorous result of 
quantum mechanics dubbed ‘the monogamy of entangle- 
ment’ says that one quantum system cannot be fully entan- 
gled with two independent systems at once. 

To escape this paradox, Polchinski and his co-workers 
realized, one of the entanglement relationships had to be 
severed. Reluctant to abandon the one required to encode 
information in the Hawking radiation, they decided to 
snip the link binding an escaping Hawking particle to its 
infalling twin. But there was a cost. “It’s a violent pro- 
cess, like breaking the bonds of a molecule, and it releases 
energy, says Polchinski. The energy generated by sever- 
ing lots of twins would be enormous. “The event horizon 
would literally be a ring of fire that burns anyone falling 
through,” he says. And that, in turn, violates the equiva- 
lence principle and its assertion that free-fall should feel 
the same as floating in empty space — impossible when 
the former ends in incineration. So they posted a paper 
on the pre-print server, arXiv, presenting physicists with 
a stark choice: either accept that firewalls exist and that 
general relativity breaks down, or accept that information 
is lost in black holes and quantum mechanics is wrong’. 
“For us, firewalls seem like the least crazy option, given 
that choice,” says Marolf. 

The paper rocked the physics community. “It was 
outrageous to claim that giving up Einstein’s equivalence 
principle is the best option,’ says Jacobson. Bousso agrees, 

adding: “A firewall simply cam’t appear in empty space, 
any more than a brick wall can suddenly appear in an 
empty field and smack you in the face.” If Einstein’s 
theory doesn't apply at the event horizon, cosmolo- 
gists would have to question whether it fully applies 
anywhere. 
Polchinski admits that he thought they could have 
madea silly mistake. So he turned to Susskind, one 
of the fathers of holography, to find it. “My first 
reaction was that they were wrong,” says Susskind. 
He posted a paper stating as much’, before quickly 
retracting it, after further thought. “My second 
reaction was that they were right, my third was that they 
were wrong again, my fourth was that they were right,” he 
laughs. “It’s earned me the nickname, ‘the yo-yo, but my 
reaction is pretty much the same as most physicists”” 

Since then, more than 40 papers have been posted on the 
topic in arXiv, but as yet, nobody has found a flaw in the team’s 
logic. “It’s a really beautiful argument proving that there’s 
something inconsistent in our thinking about black holes,” 
says Don Page, a collaborator of Hawking’s during the 1970s 
who is now at the University of Alberta in Edmonton, Canada. 
A number of inventive solutions have been offered, however. 


REAL-WORLD IMPLICATIONS 

One of the most promising resolutions, according to Suss- 
kind, has come from Daniel Harlow, a quantum physicist at 
Princeton University in New Jersey, and Patrick Hayden, a 
computer scientist at McGill University in Montreal, Canada. 
They considered whether an astronaut could ever detect the 
paradox with a real-world measurement. To do so, he or she 
would first have to decode a significant portion of the outgo- 
ing Hawking radiation, then dive into the black hole to exam- 
ine the infalling particles. The pair’s calculations show that 
the radiation is so tough to decode that the black hole would 
evaporate before the astronaut was ready to jump in’. “There’s 
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When the hole evaporates, all the : A A} 
information disappears with it. PINLWALL 


The information is carried out by quantum 
correlations among all particles radiated 
from the black hole. 


Empty space is full of particle—antiparticle 
pairs that pop into existence due to quantum 


effects and are correlated with each other. Peglated Parties Die a e 


their correlations with 
their infalling partners. 


The pairs normally 
annihilate instantly. 


>< Black holé 


If a pair forms just 
outside a black hole’s 
event horizon, one 
particle may fall in 
while the other 
escapes as visible 
Hawking radiation. 


The energy that is 
released creates a 
firewall around the 
black hole. 


Each doomed particle 
carries negative energy 
inwards, so the black 
hole steadily loses mass. 
If no ordinary matter 
falls in, the hole will 
eventually evaporate. 
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Correlations between 
the emitted particles 
contain information 
about everything that 
fell into the black hole, 
even after the hole 
evaporates. 


The ‘singularity’ at the black hole’s centre is infinitesimally 
small, infinitely dense and contains no information about 
q the matter that formed the black hole. 


no fundamental law preventing someone from measuring 
the paradox,’ says Harlow. “But in practice, it’s impossible.” 

Giddings, however, argues that the firewall paradox 
requires a radical solution. He has calculated that if the 
entanglement between the outgoing Hawking radiation and 
its infalling twin is not broken until the escaping particle has 
travelled a short distance away from the event horizon, then 
the energy released would be much less ferocious, and no 
firewall would be generated”’. This protects the equivalence 
principle, but requires some quantum laws to be modified. 
At the CERN meeting, participants were tantalized by the 
possibility that Giddings’ model could be tested: it predicts 
that when two black holes merge, they may produce distinc- 
tive ripples in space-time that can be detected by gravita- 
tional-wave observatories on Earth. 

There is another option that would save the equivalence 
principle, but it is so controversial that few dare to champion 
it: maybe Hawking was right all those years ago and informa- 
tion is lost in black holes. Ironically, it is Preskill, the man 
who bet against Hawking’s claim, who raised this alterna- 
tive, at a workshop on firewalls at Stanford at the end of last 
year. “It’s surprising that people are not seriously thinking 
about this possibility because it doesn’t seem any crazier than 
firewalls,” he says — although he adds that his instinct is still 
that information survives. 

The reluctance to revisit Hawking’s old argument is a sign 
of the immense respect that physicists have for Maldace- 
na’s dictionary relating gravity to quantum theory, which 
seemingly proved that information cannot be lost. “This is 
the deepest ever insight into gravity because it links it to 
quantum fields,” says Polchinski, who compares Maldacena’s 


result — which has now accumulated close to 9,000 citations 
— to the nineteenth-century discovery that a single theory 
connects light, electricity and magnetism. “If the firewall 
argument had been made in the early 1990s, I think it would 
have been a powerful argument for information loss,’ says 
Bousso. “But now nobody wants to entertain the possibility 
that Maldacena is wrong.” 

Maldacena is flattered that most physicists would back 
him in a straight-out fight against Einstein, although he 
believes it won't come to that. “To completely understand 
the firewall paradox, we may need to flesh out that diction- 
ary, he says, “but we won't need to throw it out” 

The only consensus so far is that this problem will not 
go away any time soon. During his talk, Polchinski fielded 
all proposed strategies for mitigating the firewall, carefully 
highlighting what he sees as their weaknesses. “I’m sorry 
that no one has gotten rid of the firewall,” he concludes. “But 
please keep trying” = 


Zeeya Merali is a freelance writer based in London. 
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A Russian defence 
unit practises rescue 
training after a 
simulated chemical 
attack in 2002. 


Update the Chemical 
Weapons Convention 


Bring biological threats into the treaty and make chemists more 
aware of the dark side of their research, says Leiv K. Sydnes. 


188 nations that signed and ratified 
the Chemical Weapons Convention 
(CWC) will meet in The Hague, the Nether- 
lands, to review the treaty for the third time. 
The documents at the heart of the discus- 
sions bear good news about progress against 
the threat of chemical warfare. But they do 
not address recent geopolitical changes or 
emerging small-scale production technolo- 
gies that pose new risks. 
Analyses from the past five years will show 
that the Organisation for the Prohibition of 


FE rom 8-19 April, representatives of the 


Chemical Weapons (OPCW) has done an 
outstanding job. About 78% of the declared 
chemical-weapons stockpile has been 
destroyed, and this is expected to rise to 99% 
by 2017. The reports will also show that all 
known chemical-weapons production facili- 
ties have been deactivated or converted for 
peaceful purposes. And they will state that 
scientific and technological developments 
have been evolutionary, not dramatic, with 
little impact on the production and use of 
chemical weapons. 

But what will be missing is a crisp 
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assessment of how major geopolitical 
changes have altered the world that the 
CWC and OPCW were created to serve. 
The CWC is an agreement between states, 
as these were the political players of the past. 
But major conflicts today involve a few coun- 
tries that have not signed the convention, 
and some small, militant groups with politi- 
cal objectives but no legal standing. Syria, 
for example — one of six countries that has 
neither signed nor acceded to the CWC — is 
said to have chemical weapons. This under- 
mines the authority of the convention. > 
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> New chemical processing 
techniques and equipment, 
including compact reactors, 
mean that terrorists can make 
small but deadly quantities of 
chemical weapons beyond the 
reach of the convention. Toxic 
compounds can also be synthe- 
sized using biological processes, 
which are not controlled. 
These facts call for two sci- 
ence-based actions: inspection 
of biotech industries, and thor- 
ough analysis of data showing 
sales of chemicals and reactor 
technology. To achieve this, we 
should rethink the CWC and 
merge it with the Biological 
and Toxin Weapons Conven- 
tion (BWC). An updated con- 
vention needs a broader remit if 
it is to limit the re-emergence of 
chemical weapons in any form 
and any amount, anywhere. 


Industrial plants in Syria are outside the remit of the Chemical Weapons Convention. 


LIMITED SCOPE 

Conferences take place every five years to 
evaluate the operation of the CWC treaty, 
assess progress and decide whether modifi- 
cations are needed. So far, few changes have 
been made. The signatory countries have 
been concerned with chemical warfare on 
a massive scale, such as the Iran-Iraq war 
(1980-88). The regulations and inspection 
regimes focus on certain types of chemicals 
owned by states in large quantities — both 
chemical weapons, suchas sarin and mustard 
gas, and the chemicals, including phosgene 
(carbonyl dichloride) and thionyl chloride, 
that are used to prepare them. 

Geopolitical changes affect the CWC 
because it deals only with national authori- 
ties and not with the powerful individuals, 
revolutionary groups, factions in civil wars 
and terrorist cells that can exert a detrimental 
influence on global security. The number of 
non-state political actors has grown as a result 
of regional conflicts, increasing terrorism and 
the collapse of national structures. The CWC 
does not give the OPCW any authority to act 
in conflicts between such parties. This does 
not prevent the destruction of existing chemi- 
cal weapons, but it could pose serious prob- 
lems in the future. 

The small-scale production of chemical 
weapons by activists and terrorists is also 
beyond the scope of the CWC, and these risks 
are growing. Advances in three areas must be 
watched carefully. 

First, closer surveillance is needed of 
chemistry that overlaps with medicine and 
biology, such as research on neuropeptides 
and bioregulators — an area that is grow- 
ing rapidly and paves the way for synthetic 
biology and the large-scale production of 
toxins. Examples are saxitoxin (a paralytic 
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neurotoxin) and ricin (an inhibitor of pro- 
tein production), which are prohibited by 
both the BWC and the CWC. It therefore 
makes sense to merge the two conventions 
and introduce the CWC inspection regime 
to the biological sciences. This will have a 
significant impact because the BWC does 
not authorize inspections. 

A second area for scrutiny is research on 
incapacitating chemical agents (ICAs) — 
compounds that act on the central nervous 
system but are meant for crowd control, rather 
than warfare’. Such work is currently allowed 
under the CWC, but I agree with interna- 
tional security expert Malcolm Dando of the 
University of Bradford, UK, that it should be 
prohibited”. Compounds such as fentanyl 
derivatives and other opioids are deadly to 
some people but not to others, depending on 
gender, age and general health. When some 
750 hostages were exposed to a fentanyl 
derivative in a Moscow theatre in 2002, for 
example, about 125 people died. Other deadly 
ICA compounds will inevitably be produced 
in the search for a safe agent for riot control. 

Third, flow microreactors must be 
tracked because they make it relatively safe 
to produce toxic compounds, as only small 
amounts of products are present in the reac- 
tor at any given time. Microreactors have 
become widespread because they are robust 
and easy to use, making it quick and easy to 
set up and dismantle a small chemical-weap- 
ons facility. The OPCW should consider 
holding a register of microreactor buyers. 

Information and people are both spread 
more widely than five years ago. The scien- 
tific literature is accessible anywhere with ade- 
quate computer facilities, and can be searched 
quickly and thoroughly. With procedures and 
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starting materials so readily 
available, almost anyone can try 
to make a chemical weapon. 
Many students are educated 
in several different countries 
and become involved in mul- 
tinational research projects. 
Researchers travel to lecture, 
to discuss and build networks, 
and to participate in virtual 
education. This mobility 
increases the likelihood that 
individuals with a university 
education in chemistry will be 
exposed to political ideas that 
lead them to commit terrorist 
acts. Indeed, at least one of the 
terrorists behind the London 
bombing in 2005 had some 
education in chemistry. 


CODE OF CONDUCT 

Chemistry cannot be blamed 
for chemical warfare or terror- 
ism, but the chemistry com- 
munity has a duty to be aware 
of the danger and to act to prevent the mis- 
use of chemicals. We must work to educate 
people about chemical safety, waste disposal 
and the responsible use of chemicals, start- 
ing at school. We need university courses on 
chemical weapons, and I welcome initiatives 
by the OPCW and several non-governmental 
organizations, including the International 
Union of Pure and Applied Chemistry, to 
provide materials for these. 

Awareness also has an ethical dimension. 
The OPCW is among those who have tried 
to compose a code of conduct for chemists, 
but so far no document has emerged. Some 
say that the civilian and military dual use 
of chemicals makes it almost impossible to 
agree on a short and precise wording. Others 
argue that signing a code means nothing if 
you are inclined to violate the CWC. 

I would like to see a code of conduct. I 
think that signing up to such a code would 
force chemists to reflect on the gravity of 
their work, increasing their levels of respon- 
sibility and awareness. This, a strong OPCW 
and a new Biological and Chemical Weapons 
Convention are the best ways to prevent the 
use of these terrible weapons. m 
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DIGITALGLOBE VIA 


Link the world’s best investigators 


Nurturing small groups of leading researchers — especially young scientists — is the 
way to break intellectual ground, says Alan Bernstein. 


en the molecular-biology 
revolution began in the 1950s, 
biochemist Erwin Chargaff 


observed that the physicists, chemists and 
geneticists involved were “practising biochem- 
istry without a licence”. He didn’t recognize 
that this small group of young innovators was 
forging an entirely new approach to biology. 

Many challenges facing humanity, from 
chronic disease to climate change, demand 
a multidisciplinary perspective and fresh 
thinking — as well as a global approach. Yet 
few seasoned researchers venture beyond 
their scientific fields, campuses or national 
funding programmes. The obstacles are even 
greater for young scientists, whose creativity 
is constrained within large teams dominated 
by senior investigators, and whose prospects 
for funding and positions are dwindling. 

As I approach the end of my first year 
as president of the Canadian Institute for 
Advanced Research (CIFAR), I have been 
reflecting on how best to support trans- 
formative advances in science. As globali- 
zation and the pace and scale of research 
increase, different sorts of institutions and 
funding models are needed. As a virtual 
global network, CIFAR offers one vision for 
supporting exchanges across borders and 
disciplines. But more needs to be done to 
enable talented young scientists to pursue 
original, multidisciplinary work. 


GLOBAL COLLABORATION 

For three decades, CIFAR has brought 
together small groups of outstanding 
researchers from varied fields to focus on 
important scientific questions. Rather than 
replicating the university campus, as other 
advanced institutes do, CIFAR functions as 
a virtual academy. From its headquarters 
in Toronto, CIFAR’s 12 networks — which 
cover topics from human health and quan- 
tum materials to economics and sustainability 
— link almost 400 fellows and advisers from 
103 institutions in 16 countries (see ‘Four 
themes’). Landmark studies so far include 
Why Are Some People Healthy and Others 
Not?! and Why Nations Fail’. 

CIFAR is not a granting agency, but 
convenes meetings for its global pro- 
grammes, typically twice a year. With an 
operating budget in 2013 of Can$17 million 
(US$17 million), CIFAR is funded by federal 
and provincial Canadian governments, indi- 
viduals and the private sector. 

The institute’s recipe for success has three 
key ingredients: framing the right question, 


FOUR THEMES 


The Canadian Institute for Advanced Research 
funds interdisciplinary researchers in four areas. 


Building Improving 
strong societies human health 
$3.91 m (23%) $2.04 m (12%) 


2011-12 
budget 


USSI7 


million 


Sustaining Transforming 
Earth technology 
$5.95 m (35%) $5.1 m (30%) 


identifying the right researchers and leader, 
and supporting a network for long enough to 
overcome disciplinary and cultural barriers. 

The compelling questions that interest 
CIFAR should presage a new field of enquiry, 
require multiple perspectives to answer and 
have enough appeal to bond a diverse group. 
The question of how early social experiences 
affect an individual's life-long physical and 
mental health and well-being, for example, is 
the focus of our Child & Brain Development 
programme’. This year, CIFAR will launch 
its first Global Call for Ideas, inviting the 
Canadian and world research community to 
propose new questions on global challenges. 

Each programme is overseen by an exter- 
nal advisory committee of 3-5 distinguished 
investigators who help to select members and 
direct the research. Programmes are reviewed 
every 5 years by an external panel on the basis 
of scientific excellence, synergy and impact. 
Programmes may be terminated, extended 
for a further 5 years, or renewed with differ- 
ent leadership or in a new direction. 

The choice of members is crucial. Scien- 
tific excellence is a prerequisite, and ‘talent 
attracts talent: The programmes disciplinary 
and geographic diversity make it essential that 
members have exemplary communication 
skills. Leadership also matters, so programme 
directors must have high standing and experi- 
ence of running their own groups. To develop 
novel ideas, they also need listening skills and 
the ability to synthesize viewpoints. 

It takes time to find common ground, so 
CIFAR typically supports programmes for 
10-15 years. Deep dialogue requires trust, 
which is built through face-to-face exchanges. 
The use of small groups (15-20 people) 
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retains intimacy while encouraging robust 
discussions and risk taking. 

There are no requirements for how much 
time each fellow must devote to the pro- 
gramme; collaborations arise naturally. Most 
fellows say that their involvement changes 
the long-term direction of their research. 

Some might argue that the hand-picking 
of fellows is elitist, but I believe that it pro- 
vides an opportunity for the best to succeed. 
On the advice of the programme director 
and the advisory committee, fellows are 
approved by the president, with appoint- 
ments monitored closely by CIFAR’s board 
of directors to avoid bias. Nevertheless, the 
involvement of one group still needs to be 
strengthened: that of young investigators. 


THE NEXT GENERATION 

Young investigators are crucial to the future 
of research, especially at a time of profound 
technological change. Computational biol- 
ogy, for example, is largely driven by emerging 
scientists. But young scientists face many hur- 
dles, from low grant success rates to a dearth 
of academic positions and the difficulty of 
developing autonomy within large teams. 
Those in the developing world are further 
hindered by scarce resources, remoteness, 
and heavy teaching and administrative loads’. 

Although CIFAR has brought 24 out- 
standing young researchers into its existing 
programmes, we recognize that this is not 
enough. In February, we launched the CIFAR 
Global Academy, an initiative that will help 
emerging researchers to develop their 
careers. It will include visits and exchanges, 
meetings and programmes at the interface 
between research, policy and practice. 

We need to increase the world’s capacity to 
collaborate and to nurture the next genera- 
tion of research leaders. We hope that other 
organizations will work with us to explore 
new partnerships and models of interaction 
across disciplines, nations and generations. = 


Alan Bernstein is president of the 
Canadian Institute for Advanced Research, 
Toronto, Ontario M5G 1Z8, Canada. 
e-mail: abernstein@cifar.ca 
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Waura tribesmen fish in the Piulaga lake near their village in the Upper Xingu region of Brazil’s Mato Grosso state. 


ANTHROPOLOGY 


A monochrome Eden 


Bob Bloomfield assesses a haunting photographic record of remote environments 
and the indigenous peoples who live in them. 


r | Vhe celebrated Brazilian documentary 
photographer Sebastiao Salgado is 
best known for his black-and-white 

images of the Serra Pelada gold mine in Bra- 

zil, shot in 1986. Those stark photographs 
of mud-encrusted workers toiling through 
the labyrinthine open mine workings high- 
lighted the appalling working conditions. 

Now, after decades of social observation, 

Salgado has turned his lens on the environ- 

ment, to convey how its degradation has led 

to much of the inequity he has recorded. 
Genesis is a project eight years in the mak- 
ing — the culmination of journeys across 

32 countries to document remote ecosystems 

and local peoples whose survival now hangs 

bya thread. The 250 photographs on show at 

London's Natural History Museum are part 

of a portfolio that is touring venues around 

the world. Salgado calls his new work Genesis 
because he wanted to record “the animal spe- 
cies that have resisted domestication” and the 

“remote tribes whose ‘primitive’ way of life is 

largely untouched”. He hopes that by high- 

lighting this “uncontaminated world” he can 
help it to be preserved and, where degraded, 
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restored. His visionis Sebastido 

that development need Salgado: Genesis 

not be synonymous WATERHOUSE 

with destruction. Se NA 
The images include "1 O°Y MSFN 

eames : LONDON 
breathtaking vistas 17 apritio 
across Siberian wastes, 8 September 2013 


in verdant Amazonian 

forested valleys, over the haunting Namibian 
dune fields and among the animals at home 
in the harsh beauty of Antarctica. He spent 
time with local peoples, including the Xingu 
in the upper Amazon, the Bushmen in Bot- 
swana and the Dinkas in Sudan. There are 
striking records of daily life: foraging, mak- 
ing fires, preparing food, socializing and 
sharing ritual celebrations. 

Salgado provides no contrasting images 
of the globalized world, dominated by urban 
expansion and the conversion of natural land 
for burgeoning infrastructure, transportation, 
extraction and agro-forestry. Nor, in hymn- 
ing the intrinsic beauty of wilderness, does he 
draw attention to those regions’ wider value 
as vital ‘organs of the biosphere, contribut- 
ing to the planet's regulatory system. There is 
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instead a nostalgia for a halcyon disappear- 
ing world — worthy and understandable, but 
saying little about the broader imperative to 
protect global wilderness and biodiversity. 
Similar issues cloud the portraits. Salgado 
spent time with the people he recorded, and 
the images were taken with their consent. 
Yet his photographs of proud people seem 
to be coloured by the Arcadian ideals of the 
late eighteenth century — a vision of ‘noble 
savages’ populating an Eden somehow pre- 
served from the ravages of civilization. In 
part because of the likeness of the black- 
and-white images to old sepia photographs, 
I found myself reminded of colonial and 
ethnographic photographic archives. 
Today, our understanding of indigenous 
peoples is more nuanced. Even though they 
may be profoundly culturally adapted to 
their environments, their ‘sustainable life- 
styles often come at a price, such as high 
child mortality or relatively short lifespans. 
And whereas many indigenous cultures 
are seriously threatened by outside forces, 
many are increasingly proactive in fighting 
for their traditional and customary rights 
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to the land, and are adopting new ideas and 
technologies on their own terms. 

For example, in Brazil, the Kaxinawa live 
in the state of Acre in the western Amazon, 
where forest covers 88% of the land but 
indigenous groups control only 1.5% of it. 
Kaxinawa elders are working with the state 
government to ensure official recognition 
of and compensation for their traditional 
stewardship of the forests. In Ecuador, the 
Huaorani, whose lands are under threat 
from logging and oil interests, are using 
geographic information system technology 
and traditional knowledge to map areas for 
potential tourism and conservation, creat- 
ing a community resource for planning and 
negotiating. There is nothing of this side of 
the story in Genesis. 

Salgado spends time on the Atlantic coast 
of Brazil, home to one of the most biodiverse 
forest ecosystems in the world, the highly 
threatened Atlantic Forest (Mata Atlan- 
tica), of which only small fragments survive. 
With his wife, Lélia Deluiz Wanick, Salgado 
has been restoring a denuded 630-hectare 
site near Aimorés by planting trees and 
reintroducing other species. They have also 
founded the Instituto Terra, an initiative 
that supports the environmental education 
of Brazilian officials, teachers and farmers, 
and aims to show that it is possible to restore 
environments that seemed lost forever. 

Although Genesis brings into sharp focus 
the need to preserve both the wilderness 
and the well-being of the peoples relying 
on it, the role of the outside world is more 
ambiguous. Both the Atlantic Forest pro- 
ject and the exhibition are supported by the 
Brazilian multinational Vale, based in Rio de 
Janeiro, one of the biggest construction and 
mining companies in the world. Its website 
states that “it is committed to quality of life 
and environmental preservation”. However, 
some indigenous peoples and human-rights 
groups oppose Vale's stakes in major dam 
and mining projects in the Amazon interior. 

If Genesis gets people to reconsider the 
value of the disappearing wild, it will have 
achieved an important aim. If it promotes dis- 
cussion on the plight of indigenous peoples, 
their self-determination and their potential as 
stewards of wilderness crucial to our future, it 
willhave offered an invaluable revelation. But 
however visually stunning and moving these 
images are, to me they reduce the complex 
realities for local people and remote environ- 
ments of our changing world to black and 
white, rather than shades of grey. m 


Bob Bloomfield is an independent expert, 
based in the United Kingdom, on public 
engagement and understanding of science 
and environment, and formerly in the Public 
Engagement Executive of London’ Natural 
History Museum. 

e-mail: bob_bloomfield@ bobania.f9.co.uk 


Books in brief 


Napalm: An American Biography 

Robert M. Neer HARVARD UNIVERSITY PRESS 352 pp. $29.95 (2013) 

In 1942, in a secret lab at Harvard University in Massachusetts, 
chemist Louis Fieser and his team created napalm — an incendiary 
gel that sticks to skin and can burn down to the bone. Robert Neer’s 
harrowing story veers from Fieser’s tests on ‘kamikaze’ bats fitted with 
napalm bombs to the 1944-45 incendiary bombing of Japan that 
killed 330,000 people. In 1980, the United Nations declared the gel’s 
use on concentrations of civilians a war crime. An interview with Phan 
Thi Kim Phuc — photographed running naked and napalm-burned 
during the Vietnam War — underlines the cost of miscast innovation. 


Invisible in the Storm: The Role of Mathematics in Understanding 
Weather 

lan Roulstone and John Norbury PRINCETON UNIVERSITY PRESS 346 pp. 
$35 (2013) 

Mathematicians lan Roulstone and John Norbury demystify the 
maths behind meteorology. Trailblazers’ work is vividly evoked, from 
eighteenth-century mathematician Leonhard Euler on hydrostatics to 
physicist Vilhelm Bjerknes’s numerical weather prediction. The pace 
cranks up with twentieth-century advances such as Jule Gregory 
Charney’s harnessing of the gargantuan ENIAC computer for his 
work in the 1940s and 1950s on forecasting pressure patterns. 


Gold Rush in the Jungle: The Race to Discover and Defend the 
Rarest Animals of Vietnam’s “Lost World” 

Dan Drollette Jr CROWN 336 pp. $25 (2013) 

From elusive forest oxen to barking deer, Vietnam has seen a raft of 
newly discovered species emerge during the past decade. Almost as 
swiftly, the black market is bringing many close to extinction. Science 
journalist Dan Drollette Jr reveals the courage and ingenuity of 
researchers intent on preserving what wildlife they can. Drawing on 
years of visits to the country, his snapshots of these wildlife warriors 
—such as langur specialist Tilo Nadler — reveal approaches that 
could show the way for conservationists in other tight spots. 


Project Sunshine: How Science Can Use the Sun to Fuel and Feed 
the World 

Tony Ryan and Steve McKevitt ICON 320 pp. £16.99 (2013) 
Polymer chemist Tony Ryan and consumerism expert Steve 
McKevitt train a ray of sunshine on the issues of food and energy 
security. They argue — and are backed by a major research 
project at the University of Sheffield, UK — that a “solar economy” 
can sustainably support the world population of 9 billion people 
predicted for 2050. Cogently analysing business as usual won't 
work; the authors lay out a science-based, multi-pronged solution 
incorporating measures such as state-of-the-art photovoltaics. 


Bug Music: How Insects Gave Us Rhythm and Noise 
David Rothenberg ST MARTIN’S PRESS 288 pp. $26.99 (2013) 
Philosopher and musician David Rothenberg asks whether the 
apping, clicking, humming leitmotifs of the insect world inspired 
aay ft human rhythm. The result mixes research on insect behaviour and 
vous. anatomy, sonograms and more with apt artistic digressions. From 
say he deafening ‘surround-sound’ of the 17-year cicada Magicicada 
cassini to syncing crickets, vibrating three-humped treehoppers and 
he “penile music” of water boatmen, this is an enchanting foray into 
he “polyrhythmic swirls of the entomological soundscape”. 
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| COMMENT | BOOKS & ARTS 


PALAEONTOLOGY 


Giants unearthed 


Xu Xing revels in an enthusiast’s tour of the Mesozoic era and its denizens. 


rian Switek is a science writer with 
B a deep understanding and broad 

knowledge of evolution. His books 
offer a compelling mixture of reliable infor- 
mation, personal experiences and thoughts, 
anecdotes about palaeontological research 
and even science philosophy, delivered in a 
breezy and engaging style. 

In My Beloved Brontosaurus, Switek 
— a fan of dinosaurs since childhood — 
describes his tours of famous excavation 
sites, selected museums and high-tech 
research laboratories across the United 
States. He meets and talks to a host of 
palaeontologists, including fossil-bone 
expert Mark Goodwin at the University 
of California Museum of Paleontology at 
Berkeley. By getting close to the people 
and the science that they move forward, 
Switek paints a comprehensive picture of 
how our understanding of dinosaur evo- 
lution gradually advances. Along the way, 
he shows how dinosaur palaeontology is a 
cultural touchstone in the United States and 
many other countries. 

Switek touches on widely discussed topics, 
such as how dinosaurs rose to dominance 
early in the Mesozoic era (250 million to 
65 million years ago) and why all of them — 
except the strange feathered ones we know 
as birds — died out when the era ended. But 
his focus is on how dinosaurs lived. 

Switek picks through recently discovered 
fossils such as the carnivorous feathered 
dinosaur Sciurumimus albersdoerferi found 
in Germany in 2011. And he discusses cut- 
ting-edge methods such as the use of com- 
puted tomography scanning to investigate 
fossils’ nasal passages, brain cavities and other 
internal cranial features. A recurring theme 
of the book is that new specimens, technolo- 
gies and thinking are 
making it possible to 
investigate how dino- 
saurs mated, matured 
and became gigantic; 
to study their social life 
and the diseases that 
afflicted some indi- 
viduals; and even, as 


in the case of the small, Vly Beloved 
bird-like Anchiornis, Brontosaurus: 
On the Road with 


to reconstruct aspects 
of their coloration. 
Some of Switek’s 
best palaeontologi- 
cal stories come from 
the history of the 


Old Bones, New 
Science, and Our 
Favorite Dinosaurs 
BRIAN SWITEK 
Scientific American: 
2013. 272 pp. $26 
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Sculptor Brian Cooley’s reconstruction of a 
Tyrannosaurus rex. 


discipline. He explains how his favourite 
dinosaur when he was a child was popularly 
known as brontosaurus, but had in fact been 
known scientifically as Apatosaurus since 
1903, when it was reclassified as a species of 
the latter genus. Eventually, the new name 
filtered through to museum exhibits and 
popular books, and although brontosaurus 
may not have disappeared entirely, the cur- 
rent generation of dinosaur-savvy children 
are aware of its obsolescence. 

More often, Switek focuses on recent 
advances. The rediscovery of an animal 
called Effigia in an old collection of fossils at 
the American Museum of Natural History 
in New York helped researchers to show that 
some early relatives of crocodilians were sim- 
ilar to many dinosaurs — not only did they 
have a near-vertical rather than sprawling 
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hindlimb posture, but they were also bipedal. 
A reluctant field decision to break the thigh 
bone of a Tyrannosaurus rex for ease of trans- 
port exposed a type of bone tissue seen today 
in female birds that are about to lay eggs, 
revealing the dinosaur’s gender. And a grad- 
uate student’s thesis research on the ink sacs 
of a fossil squid led to a method for partially 
reconstructing the colours of some extinct 
animals, including dinosaurs. 

Ihave a few quibbles. In some instances, 
Switek recounts a conversation he had 
with a scientist to lend authority to an 
account of a discovery, but the details on 
how inferences were made and hypoth- 
eses tested are left out. The phylogeny- 
based definitions given for some animal 
groups are slightly inaccurate, and there 

are occasional lapses in the reliability of 

the scientific storytelling. For example, 
in discussing the importance of Effigia, 
Switek glosses over the fact that near-vertical 
hindlimb posture (although not bipedality) 
was already known to be widespread among 
early members of the crocodilian lineage. 

The book largely revolves around the 
dinosaur stars from the early days of North 
American palaeontology, and space for 
recent discoveries made by researchers else- 
where is limited. Similarly, some fascinating 
new research directions are discussed briefly 
or not at all. 

Switek does not mention the 34 juve- 
niles of the small, parrot-beaked dinosaur 
Psittacosaurus that were found preserved 
with an adult in Chinese strata — a fossil 
that is among the best evidence to support 
the existence of parenting in dinosaurs. He 
also omits the use of isotopic analysis to 
reconstruct the lifestyle of the mysterious 
spinosaurs — massive carnivorous dino- 
saurs with vaguely crocodile-like heads. Nor 
is there anything about molecular studies 
on the development of feathers, fingers and 
toes in modern taxa, which have contributed 
greatly to our understanding of the morpho- 
logical evolution of birds. 

Of course, not even a brontosaurus-sized 
book could include everything. Switek has 
succeeded in covering a wide range of inter- 
esting topics in dinosaur palaeontology with 
infectious enthusiasm. = 


Xu Xing is professor in the Institute 

of Vertebrate Paleontology and 
Paleoanthropology of the Chinese Academy 
of Sciences in Beijing. 

e-mail: xu.xing@ivpp.ac.cn 


REUTERS 


Correspondence 


Sexism: dearth of 
female role models 


Although “science remains 
institutionally sexist”(Nature 
495, 21; 2013), gender inequality 
does not operate on its own. It is 
mediated by other inequalities, 
such as social class and race, 
which intersect. 

Of the four researchers you 
use to illustrate the contribution 
of young, driven women to the 
scientific landscape (Nature 
495, 28-31; 2013), at least two 
come from families in which 
both parents were scientists. 
This could well have habituated 
them in scientific practice 
from an early age. The reality is 
that most female scientists-in- 
training struggle to find female 
role models to smooth their 
transition into an academic 
career. 

This effect is particularly 
isolating for women of African 
or Caribbean origin, who 
are among the most under- 
represented groups in the 
sciences. 

We call for rigorous qualitative 
research into the interaction 
of all these factors so that the 
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A sample of responses 
from the debate on 
closing the gender 
gap in science 
(Nature 495, issue 
7439; 2013). 


occupational field can be cleared 
of cultural disadvantages for 
aspiring women and minority 
scientists. 

Simon Williams, Christine 
Wood, Richard McGee Feinberg 
School of Medicine, Northwestern 
University, Chicago, Illinois, USA. 
simonwilliams@northwestern.edu 


Sexism: a revealing 
experiment 


Few of the depressing statistics 
on women in science pinpoint 
hard evidence for bias against 
individuals (see, for example, 
Nature 495, 22-24; 2013). SoI 
conducted a small experiment of 
my own. 

We know that successful 
grant applications are important 
drivers of promotion and tenure. 
In my first year as a researcher 
in 2005, I submitted 16 grant 
applications under my full name, 
which is not gender neutral. Just 
one received funding. 

In my second year, I applied 
using only my initials and my 
last name: my success rate went 
up fivefold. This was only an 
n= 1 experiment, but I didn’t 


James Dwyer says: 

Simply quantifying inequities 
does not provide indisputable 
evidence of discrimination. 

A complete quantitative 
analysis would require that pay 
differences be normalized by 
some reliable measure of job 
performance — of value to the 
employer. Likewise, the number 
and value of research grants 
awarded should be normalized 
by not just the number of 
applications, but ideally by 
some independent measure of 
their quality. 


Julia Piaskowski says: 

The ‘30-something’ women 
you describe are astounding 

in their abilities to keep their 
careers spinning, but they set a 
very high bar. This is unrealistic 
for the average female scientist 


care to repeat it. 

In my third year, the university 
adopted an electronic grant- 
application system, which, 
unbeknown to me, automatically 
entered my full first name. In 
this blinded study, my success 
rate went down fivefold again, 
coincident with changing 
back to a female name on the 
application. 

The following year, I asked 
the university to modify my 
entry to use only initials for my 
first names, thereby frustrating 
the automated system. My 
success rate went back up 
fivefold. 

Iam the same applicant. 

The replicates are low, but the 
outcome apparently differed 
only when it was obvious to the 
reviewers that I was female. 
Tina M. Iverson Vanderbilt 
University Medical Center, 
Nashville, Tennessee, USA. 
tina.iverson@vanderbilt.edu 


Sexism: science 
biographer responds 


In criticizing the stereotyping 
of female scientists, historian 


and may have profound 
consequences for child-rearing. 


Helen Knob says: 

As an associate professor at a 
high-ranking university and the 
mother of two young children, | 
see promising female graduate 
students and postdocs take 

off in fright from academia: 
they see what | am doing and 
think itis too hard. But running 
any enterprise with huge 
responsibilities as a full-time 
working mother is always 
going to be challenging, not 
least because ‘full time’ usually 
means almost every waking 
minute. 


Vivian Zapf says: 

When my children arrived, | 
switched to part-time working. 
| thought that my career would 


Patricia Fara creates a new 
stereotype — for biographies 
of female scientists (Nature 
495, 43-44; 2013). She laments 
that they portray their subjects 
as weird, and protests against 
catchy titles, elements of cover 
design and the use of first 
names rather than surnames. 
As the author of one of the 
five books Fara criticizes, I 
would have welcomed a more 
substantive discussion. The 
challenges of writing scientific 
biography are more complex 
than she implies, whether the 
subject is female or male. 

My book is, among other 
things, an exploration of the 
intertwined roles of truth and 
beauty in science, and of how 
a mathematician (Dorothy 
Wrinch) and a chemist (Linus 
Pauling) came to see them very 
differently. That difference is 
reflected in the title I Died for 
Beauty: Dorothy Wrinch and 
the Cultures of Science. I fail to 
see how, as Fara contends, this 
trivializes Wrinch’s intellect. 
Marjorie Senechal Smith 
College, Northampton, 
Massachusetts, USA. 
senechal@smith.edu 


be permanently ruined. But by 
getting off useless committees, 
removing junk-work from 

my schedule, delegating and 
becoming more efficient, | 
found myself producing just 
as much science half-time as | 
had been full-time. And there 
is a reason why sabbaticals are 
built into the academic career: 
we all need time to step back 
and re-evaluate, and spending 
time with our children can 
provide just this opportunity. 


Anna Sutton says: 

You know what's not helping? 
Quoting women saying “it’s like 
not getting asked to dance”. 

No wonder we're not asked to 
participate in technical and 
scientific activities if all we 

can do is focus on immature 
emotional reactions. 
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OBITUARY 


Donald Arthur Glaser 


(1926-2013) 


Physicist and biotechnologist who invented the bubble chamber. 


onald Arthur Glaser was a great 
D physicist and one of the first bio- 

technology entrepreneurs. He was 
also the least arrogant, most delightfully 
funny person you could hope to meet. He 
was aged just 25 when he designed and 
built the first bubble chamber, a device that 
allowed physicists to visualize with unprece- 
dented precision the trajectory of elementary 
particles, and so measure their properties. 

Glaser, who died on 28 February, was born 
in 1926, in Cleveland, Ohio. He received a 
bachelor’s degree in physics and mathemat- 
ics in 1946 from Cleveland’s Case School of 
Applied Science (now Case Western Reserve 
University). As a child, he loved the arts — 
especially music — and while at Case he 
played the viola in the Cleveland Philhar- 
monic Orchestra. 

After his degree, Glaser started a PhD in 
physics at the California Institute of Tech- 
nology (Caltech) in Pasadena. His interest in 
particle physics led him to work with Nobel 
laureate Carl Anderson, studying cosmic 
rays. To do this, he used a cloud chamber, 
an early particle detector that was essentially 
a sealed chamber filled with vapour. Dur- 
ing these years, Glaser learned to design 
and build the equipment he needed for his 
experiments, a skill that served him well 
when he became an assistant professor at 
the University of Michigan in Ann Arbor 
in 1949, 

It was at Michigan that Glaser built the 
first bubble chamber, an improvement 
on the cloud chamber. Whereas the cloud 
chamber was filled with vapour, the bub- 
ble chamber is filled with liquid. A piston 
produces a sudden decrease in pressure, 
forcing the liquid into a superheated state. 
Charged particles create an ionization track 
around which the liquid vaporizes, forming 
microscopic bubbles; the density of bubbles 
is proportional to the particle’s energy loss. 
The device had a profound effect on parti- 
cle physics, enabling the use of ever larger 
accelerators. 

Glaser possessed a charming mix of 
curiosity and innocence. For a physicist 
considering how to visualize elementary 
particles, a natural step would have been 
to consult the relevant chapter in Enrico 
Fermi’ 1937 book Thermodynamics, the 
reference text for particle physics. But 
instead Glaser worked through his own cal- 
culations from scratch. Had he read Fermi’s 
book, he would have seen an equation that 
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implied that the bubble chamber could not 
work. Fermi had an iron-clad reputation 
for not making mistakes, but he had made 
one here, leaving the door open for Glaser’s 
great invention. 

In 1960, at the age of just 34, Glaser 
received the Nobel Prize in Physics for his 
work on the bubble chamber. Glaser’s super- 
visor Anderson was the student of Robert 
Millikan, another physics Nobel prizewinner 
from Caltech. According to Glaser, his 
students used to look at one another and 
wonder who would be the fourth in this dis- 
tinguished line. In 1993, the Nobel Prize in 
Chemistry did go to someone with whom 
Glaser had worked. That person was bio- 
chemist Kary Mullis, who received the prize 
for inventing the polymerase chain reaction 
(PCR), which can amplify one or a few pieces 
of DNA into thousands or millions of cop- 
ies. Yet the connection between Glaser and 
Mullis had nothing to do with physics, but 
biotechnology. 

Glaser became professor of physics at the 
University of California, Berkeley, in 1959. 
However, he grew tired of particle physics, 
which increasingly involved large teams of 
investigators using big instruments, and 
became professor of molecular biology 
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in 1964. His passion and skill at building 
equipment resulted in several new methods 
for automating large-scale hunts for phar- 
maceutically valuable mutants in organisms 
such as bacteria. He then did something 
that was unusual at the time. In 1971, with 
Ron Cape and Peter Farley, he co-founded 
Cetus, the first US biotechnology company, 
becoming chairman of its science advisory 
board. This was where Mullis discovered 
PCR, and other researchers synthesized 
interferon, a protein that helps to medi- 
ate the immune response. The company 
merged with Chiron in 1991 and was later 
purchased by Novartis. 

As molecular biology became increas- 
ingly about biochemistry, Glaser again 
considered a career change. Partly because 
of his love of visual art, and because of 
problems he had encountered while train- 
ing workers to recognize bubble-chamber 
tracks or assess the growth of cultures, he 
became fascinated by human vision. It was 
through this new interest that I met Glaser 
in 1981, and again in 1982 in Cambridge, 
Massachusetts, during his mini-sabbatical 
at the Rowland Institute — now part of Har- 
vard University — where Edwin Land, the 
founder of Polaroid, was studying human 
perception of colour. 

In the early 1980s, Glaser worked on 
visual psychophysics — a body of quantita- 
tive methods to measure perception — and 
on computational modelling of the visual 
system. From 1989, his position at Berkeley 
was professor of physics and neurobiology. 

Don and I had stayed in close touch since 
1981. Iloved our discussions and the obser- 
vations he made, which were simple, crisp 
and deep. It is both a privilege anda burden 
to win a Nobel prize at such a young age. 
And I always admired the grace with which 
Don carried his success in science and in 
industry. Somehow he retained the happy 
curiosity of a child: the world for him was 
a garden of wonders. He was able, in an 
apparently effortless way, to come up with 
refreshingly counterintuitive observations 
of things and of people. m 
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Positive and radical 


Biosynthesis of the antibiotic fosfomycin ends with the formation of an ‘epoxide’ ring structure. The epoxide is suggested 
to form from a cationic intermediate, rather than from a free radical. SEE LETTER P.114 
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chemistry are often mediated by 
metalloenzymes that catalyse single- 
electron chemistry. In organic sub- 
strates, such reactions are typically 
thought to involve radicals (in which 
a carbon atom has one unpaired 
electron), but detecting these inter- 
mediates has been challenging 
because they are highly reactive. 
The non-haem iron-dependent 
enzymes form a large class of such 
metalloenzymes'. On page 114 of 
this issue, Chang et al.” provide data suggesting 
that at least one enzyme in this class uses both 
a carbon-centred radical and a carbocation 
as intermediates during the biosynthesis of 
fosfomycin, an antibiotic used in clinical 
settings and produced by several bacterial 
strains. 

Fosfomycin contains a reactive epoxide 
group that is essential for its mode of action 
(Fig. 1a). The introduction of this oxygen- 
containing group during biosynthesis takes 
place through an unusual reaction catalysed 
by the enzyme (S)-2-hydroxypropylphospho- 
nate epoxidase (HppE). Typically, the oxygen 
atom in naturally occurring epoxides derives 
from molecular oxygen (O,),and is added to a 
carbon-carbon double bond in the epoxide's 
precursor molecule. However, in vivo experi- 
ments’, as well as in vitro incubation’ of HppE 
with isotopically labelled (S)-2-hydroxypropyl- 
phosphonate, ((S)-2-HPP, the enzyme’s sub- 
strate), have demonstrated that fosfomycin’s 
epoxide oxygen derives from the alcohol group 
(OH) in 2-HPP. Since the first purification and 
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Figure 1 | Reactions catalysed by the (S)-2-hydroxypropylphosphonate 
epoxidase (HppE) enzyme. a, HppE catalyses the formation of the 
antibiotic fosfomycin from (S)-2-HPP by forming an epoxide group 
(circled). The oxygen atom in the epoxide comes from an alcohol group 
(OH) in the starting material, rather than from the oxygen (O,) used as 
a reagent. NADH and FMN are cofactors. b, Chang et al.’ report that 
HppE converts a different substrate, (R)-1-HPP, into 1-oxopropan-2- 
ylphosphonate. The carbon atom shown with a blue dot ends up in the 
aldehyde group (CHO) of the product, and a hydrogen atom (known as 
the pro-R hydrogen; red) is lost. 


definitive assignment of HppE as the epoxidase 
that carries out this reaction’, the enzyme has 
been extensively studied to uncover the cata- 
lytic mechanism involved. 

HppE needs oxygen and two electrons from 
a reducing agent to convert 2-HPP to fosfomy- 
cin. The coenzyme NADH, in the presence of 
the catalytic cofactor flavin mononucleotide, 
can serve as the two-electron donor to activate 
oxygen for the reaction in vitro*. Enzymology” 
and structural studies® have allowed the for- 
mulation of a putative mechanism in which a 
reactive iron species at the active site of HppE 
removes a hydrogen atom from the substrate 
to generate a carbon-centred radical. It was 
postulated that the epoxide forms when this 
radical reacts with 2-HPP’s deprotonated alco- 
hol (the anion formed when the alcohol group 
has lost a proton, H’), which is bound to iron 
(Fig. 2a). The latter step could be described 
as an intramolecular version of the ‘oxygen 
rebound’ step that has been proposed for reac- 
tions catalysed by haem- and non-haem iron- 
dependent enzymes’. 
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Enzymatic reactions with sub- 
strates that involve one-electron 
chemistry are often well-controlled, 
and intermediates are difficult to 
detect because of their short-lived 
nature. Substrate analogues can be 
used to divert enzymes from their 
normal chemistry to an alternative 
outcome that can be diagnostic of a 
certain type of reactive intermediate. 
These analogues can therefore pro- 
vide indirect evidence of the presence 
of an intermediate during the normal 
reaction. 

In this spirit, Chang et al. discov- 
ered that incubation of HppE with the 
alternative substrate (R)-1-HPP gen- 
erated 1-oxopropan-2-ylphosphonate 
(Fig. 1b). This product indicates that 
the phosphonate group (PO,” ) in the 
substrate had moved to an adjacent 
carbon atom, in a process known asa 
1,2-migration. Although this type of 
migration of a phosphorus-contain- 
ing group has been observed before 
under ‘harsh conditions (such as with 
strongly basic reagents in organic 
solvents), its catalysis by an enzyme 
is unprecedented. 

Chang and colleagues investigated the 
possible mechanism for the formation of the 
unexpected product by solving the crystal 
structure of HppE in complex with (R)-1-HPP. 
The structure showed that one of the hydro- 
gens in the molecule (the pro-R hydrogen at 
the C2 position of (R)-1-HPP, Fig. 1b) is cor- 
rectly oriented for removal by a reactive iron- 
oxygen species. To obtain further support for 
the idea that this hydrogen is removed, the 
authors incubated the enzyme with (R)-1-HPP 
in which the pro-R hydrogen was replaced with 
a deuterium atom. The product of the result- 
ing reaction did not contain any deuterium, 
confirming the researchers’ hypothesis. The 
next question was: what type of reactive 
intermediate at C2 could go on to form the 
observed product? 

To address this, the authors performed 
model studies. They used well-established 
chemical methods to generate either a 
radical or a carbocation at C2 of their sub- 
strate analogue in the absence of HppE, and 
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Figure 2 | Proposed mechanisms of HppE-catalysed reactions. a, In an earlier proposed mechanism for 
the HppE-catalysed synthesis of fosfomycin (orange), epoxide formation occurred through an intermediate 
radical. Fe, iron (Roman numerals indicate the oxidation state of the iron); e , electron; Glu and His are the 
amino acids glutamate and histidine, respectively. The Fe=O species and the amino acids are part of the 
active site of HppE. The black dot is a single electron (the radical electron). Fish-hook arrows (red) indicate 
the movement of single electrons. b, Chang et al.” propose that in the final stages of the HppE-catalysed 
reaction of (R)-1-HPP, a one-electron transfer generates a carbocation, which rearranges in a 1,2-migration 
process to yield the product. Green arrows indicate movement of electron pairs. The phosphonate group 
(PO,”) moves from one carbon (blue) to the adjacent carbon (pink). c, The authors suggest that the final 
stages in the biosynthesis of fosfomycin might also involve a carbocation intermediate. 


evaluated whether these intermediates gave 
rise to the same product as the enzyme-cat- 
alysed reaction. Their results show that, in 
organic solvents, a carbocation will indeed 
rearrange to generate the same product as 
the enzymatic reaction (Fig. 2b), whereas 


the corresponding radical will not. 

Chang and colleagues’ data provide persua- 
sive support for a carbocation intermediate 
in the enzymatic reaction of (R)-1-HPP. The 
carbocation might be formed by the transfer 
ofan electron from an initially formed radical 


Western North 
America’s jigsaw 


The tectonic history of western North America is a puzzle in which many of the 
pieces are mashed up or missing. Seismic images of the deep Earth reveal features 
that alter our reconstruction of this puzzle in space and time. SEE ARTICLE P50 


SASKIA GOES 


he western margin of the North 
American continent, comprising the 
Cordilleran mountain chain that runs 
from Alaska to Mexico, consists of scores of 
continental jigsaw pieces of different origins, 
sizes and ages. They came together as Cor- 
dilleran North America was assembled in 
a slow dance of tectonic plates over the past 


350 million years’. It has been widely accepted 
that one large oceanic plate, the Farallon (and 
fragments thereof), acted as a conveyor belt, 
carrying all these pieces to North America’s 
western margin. When the plates converged, 
the dense oceanic plate sank into the mantle 
to form a slab below the lighter continent in 
a process called subduction. During subduc- 
tion, volcanic island chains or small continen- 
tal fragments on the oceanic plate were scraped 
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to a highly oxidizing iron(Iv) species (Fig. 2b). 
Further studies will be required to investigate 
whether such a carbocation intermediate is 
also formed during the reaction with 2-HPP 
(Fig. 2c). Moreover, Chang and co-workers’ 
finding raises the question of whether other 
non-haem iron-dependent enzymes that 
use similar substrates but achieve different 
transformations” also generate carbocation 
intermediates. Regardless of the answer, 
the discovery of a 1,2-phosphono migra- 
tion adds another reaction to the already 
impressive array of chemical transforma- 
tions that are catalysed by non-haem iron- 
dependent enzymes in general’, and by HppE 
in particular. = 
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off to become new North American real estate. 
In this issue, Sigloch and Mihalynuk’ propose a 
new way of putting the pieces together, involv- 
ing at least two additional oceanic plates. Their 
analysis is based on tomographic imaging of 
the deep mantle using earthquake waves, 
which reveals previously unseen features of 
the oceanic jigsaw. 

Cordilleran rocks document a varied sub- 
duction history’. Plate convergence along 
the western North American margin proba- 
bly started along an offshore subduction zone 
overlain by a chain of volcanoes arranged in 
an arc. The oceanic basin separating this vol- 
canic arc from the mainland may have initially 
been closed by westward dipping subduc- 
tion followed by eastward subduction below 
the continental margin. Several other intra- 
oceanic arcs and at least two larger ‘exotic’ 
terranes (blocks of crust deposited onto a 
plate), the Intermontane and Insular terranes, 
were subsequently accreted to North Amer- 
ica’s west coast. From the mid-Cretaceous 
geological period about 90 million years 
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Future 


Kula 


Figure 1 | Two potential plate scenarios for North America’s Cordilleran 
assembly. a, The previously proposed plate configuration (corresponding 
to 80 million years ago), in which an eastward-subducting and fragmenting 
Kula/Farallon oceanic plate, bordered by the Pacific plate, rafted in all 
Cordilleran terranes (brown) and produced a hook-shaped mantle slab 
below the North American continent. A shallower section of the Kula plate 
formed a western extension of the hook-shaped slab. Only a remnant of 
the Farallon plate known as Juan de Fuca is currently subducting below 
North America. b, Sigloch and Mihalynuk’s newly proposed scenario” 


ago, a continental volcanic arc formed on the 
Cordilleran margin, sections of which con- 
tinue to be active today. Sigloch and Mihalynuk 
link their tomographic imaging record of the 
deep mantle to the past 100-200 million years 
of Cordilleran assembly. 

Reconstructing the assembly history of 
continental terranes is challenging because 
when plate slivers attach to a continent they 
are deformed and partly dismembered, and 
are further reshaped when additional pieces 
arrive. By contrast, the denser oceanic plates 
tend to remain largely intact during their tec- 
tonic cycle. Subducted oceanic-plate segments 
in the mantle therefore present a simpler image 
of plate history, although only mantle features 
with dimensions of 100 kilometres or larger 
can be resolved by earthquake tomographic 
imaging. 

Sigloch and Mihalynuk’s images reveal three 
major mantle slabs at a depth of between 1,000 
and 1,800 km; the western slab stands alone 
and the two eastern ones are imaged as a con- 
tinuous hook-shaped feature. This hook has 
previously been interpreted*” as the result of 
the eastward subduction of a Farallon plate 
that broke at around 80 million years ago into a 
northern segment (the Kula plate) and a main 
Farallon core (Fig. 1). In previous tomographic 
images, the western slab was prominent only 
at depths above 1,500 km and was interpreted 
as a shallower section of a mildly dipping Kula 
slab*. An even shallower western slab was 
linked to a further (Juan de Fuca) fragment of 
the northern Farallon plate, which is currently 
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subducting at the Cascadia trench just offshore 
of the northwestern United States. 
Surprisingly, the western feature in Sigloch 
and Mihalynuk’s images seems to be a North 
Farallon slab that is continuous between the 
present-day Cascadia trench and a depth of 
1,800 km. They therefore propose that the other 
deep slabs belong to non-Farallon oceanic 
plates, which they name the Angayucham 
and Mezcalera plates. These would have been 
attached to the North American continent and 
subducted westwards below a set of terranes 
that separated them from the Farallon plate 
(Fig. 1). Terrane docking ages, obtained from 
radiometric and fossil dating of the terrane 
rocks, require that the hook slabs predate the 
separation of the Kula and Farallon plates by 
about 60 million years, implying lower slab- 
sinking velocities and faster North American 
motion before 140 million years ago than if the 
imaged fragments were Kula-Farallon slabs. 
The position of the features attributed to the 
South Farallon and Mezcalera slabs in the new 
tomographic images requires that the eastward 
subduction of the South Farallon ocean floor 
started at about the same time as the closure of 
the Mezcalera Ocean (the ocean underlain by 
the Mezcalera plate) was completed, deposit- 
ing this Farallon slab on top of the Mezcalera 
one. Such subduction polarity switches have 
been proposed in Cordillera reconstructions 
before, as has the presence of the Mezcalera 
Ocean’. These subduction flips are usually 
attributed to the closure of small marginal 
basins, although a few studies have proposed 
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(corresponding to 140 million years ago), in which two westward-dipping 
oceanic plates, the Angayucham and Mezcalera, formed the hook-shaped 
slab, and a western slab formed by the North Farallon plate. Both scenarios 
are based on subducted slabs currently imaged in the deep mantle at 

a depth of 1,000-2,000 km. Divergent plate boundaries between the 

Pacific, Kula and Farallon plates are displayed as double lines; convergent 
boundaries are shown as lines with teeth in the direction of the downward- 
moving plate. The grey outlines show the side views of the left section of the 


that most of ‘Cordilleria assembled as a ribbon 
continent separated from North America bya 
large ocean®. None of these proposed oceans 
has previously been identified in the mantle 
record. 

Sigloch and Mihalynuk have incorporated 
several new pieces into the North American 
jigsaw, but others remain to be placed. The 
authors have imaged several more westerly 
slabs that may require further intraoceanic 
subduction zones, for example. Some of these 
slabs have previously been interpreted as a 
record of further fragmentation of the Farallon 
plate below the southwestern United States’. 
Other studies have imaged slab material 
farther south and deeper in the mantle’, and 
these may be remnants of subduction before 
150-200 million years ago. Could some 
of these correspond to a previously pro- 
posed' period of eastward subduction of the 
Mezcalera plate? 

The new pieces remain to be connected 
into a self-consistent framework of plates 
with continuous and dynamically evolving 
boundaries. Subduction polarity flips are 
commonly associated with the arrival of buoy- 
ant terranes at the trench, for example at the 
closure of an ocean. What stopped earlier east- 
ward subduction? Did the Mezcalera Ocean 
contain a ridge that clogged this subduc- 
tion and left ocean-floor fragments stranded 
among the Intermontane terranes? Another 
intriguing question is whether a westward- 
subducting slab pulling on the North Ameri- 
can plate might have aided the opening of the 


Central Atlantic, which occurred in the same 
time frame as the proposed Angayucham- 
Mezcalera subduction. Many questions about 
the region's tectonic history remain, but this 
work highlights how tomographic images are 
providing increasingly well-defined pieces of 
the puzzle, complementing the oceanic and 
continental rock record. = 
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of crystal defects 


A clever combination of existing techniques has produced three-dimensional 
atomic images of individual platinum nanoparticles, disclosing the atomic 
structure of crystal defects within them. SEE LETTER P.74 


PATRICK J. MCNALLY 


rystals without defects are the ‘Stepford 

wives of materials science: all too per- 

fectly, almost robotically, ordered, and 
relatively uninteresting. However, when one 
adds defects to these perfectly ordered solid- 
state arrangements of atoms or molecules, the 
materials acquire useful attributes, including 
the altered conducting or optical proper- 
ties of semiconductor devices. But although 
well-tested theories exist to account for these 
defects at the atomic level, imaging them, and 
in particular doing so non-destructively, has 
proven a challenge. Chen et al.' (page 74 of 
this issue) have now made remarkable strides 
in this direction, producing three-dimensional 
(3D) images at atomic resolution of a number 
of structural defects inside a solid crystalline 
platinum nanoparticle’. 

Crystallography has a distinguished history 
of using macro-observational data — for 
example, measuring the interfacial angles in 
naturally occurring crystals such as rock salt 
or pyrite — to infer the existence at the atomic 
scale of 3D ordering in these solids. In the late 
eighteenth and early nineteenth centuries, 
French mineralogist René-Juste Hatiy postu- 
lated that these naturally occurring crystals 
were composed of well-ordered assemblages 
of polyhedra, each of which was composed of 
elemental atoms”. Almost exactly 101 years 
ago, this was confirmed in the famous experi- 
ment? of Friedrich, Knipping and von Laue, 
in which a beam of Wilhelm Réntgen’s newly 
discovered X-rays was directed at samples 


*This article and the paper under discussion‘ were 
published online on 27 March 2013. 


of crystalline copper sulphate and zinc sul- 
phide. The X-rays did not just pass through 
these crystals, but rather diffracted through 
them, producing ordered arrays of spots on 
a recording film. The location of these spots 
on the film corresponded to diffraction of the 
X-rays by individual, ordered atomic planes”, 
and confirmed the existence of atomic order- 
ing in crystals. 

Research in the intervening century has 
often tended to focus on the imperfections 
— the breakdown of local order — in these 
atomic arrays. Such imperfections can take 
the form of missing atoms (vacancies), atoms 
sitting in small gaps in these ordered arrays 
(interstitials) and many extended defects, 
of which the most common are dislocations 


Figure 1 | Edge dislocation in a cubic crystal. 
This type of crystal defect is caused by the 
insertion of an extra atomic plane. Chen et al.’ 
imaged edge dislocations at atomic resolution in 
a platinum nanoparticle that has a decahedral, 
rather than a cubic, atomic lattice. 
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(essentially, micro-cracks following specific 
allowed paths within the crystals) and inter- 
faces including low-angle grain bounda- 
ries and twins’. Researchers have imaged 
these dislocations using optical or electron 
microscopy, or X-ray-diffraction imaging 
techniques. Although they can image the 
extent of dislocations®”’, such techniques are 
incapable of imaging or producing meaning- 
ful measurements of the atomic structure of 
these defects. 

The work by Chen et al. builds on a tech- 
nique known as electron tomography. In this, a 
two-dimensional (2D) slice image of the sam- 
ple under test is acquired using a fixed angle 
between the sample and the electron beam 
that passes through and undergoes diffrac- 
tion. Thereafter, the sample is tilted at differ- 
ent angles with respect to this “zero position 
and a series of these 2D projections is recorded. 
The 2D images are then combined to produce a 
3D image of the object. Until now, the ultimate 
spatial resolution of the 3D image has been 
limited by the steps in the sequential tilting of 
the sample, although this has been improved 
on by the development of what is known as 
equally sloped tomography (EST). In this 
approach, the tilt series of images is acquired 
by altering the tilt angle using an algorithm 
that ensures that the alteration occurs with 
equal slope increments*”. 

Another problem commonly encountered 
is that the acquired images are noisy (they 
possess a low signal-to-noise ratio), which 
greatly impedes the 3D rendering and impairs 
the spatial resolution. This signal-to-noise 
problem can be lessened by using mathemati- 
cal operations known as Fourier transforms. 
These are used to relate the 3D x-y-z posi- 
tional coordinates of the recorded image 
pixels to the 3D occurrence of their spatial- 
frequency components. Smooth variations 
in the pixel intensities across the image are 
termed low-frequency variations, whereas 
sharper, more regularly occurring ones 
are termed high-frequency variations. The 
Fourier-transform methods can extract this 
frequency information and filter those com- 
ponents that have a deleterious impact on the 
signal-to-noise ratio. 

To overcome the signal-to-noise problem 
in their work, Chen et al. have subjected the 
EST images to Fourier transformation and 
used a 3D Fourier filtering method that iden- 
tifies the Bragg peaks recorded on the EST 
images (Bragg peaks are electron-diffraction 
peaks that are the equivalent of the most com- 
monly occurring spatial frequencies). This 
method also cleans up the data around the 
peaks and applies an inverse Fourier trans- 
form that converts the data back into the 
x-y-z space to produce spatially sharper 3D 
reconstructions. 

The major contribution of Chen and col- 
leagues’ work is that it combines techniques 
that are already available, namely EST and 3D 
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Fourier filtering, to produce images of most of 
the platinum nanoparticle, which has a volume 
of approximately 850 cubic nanometres. The 
authors have proved the existence of atom- 
size steps at the boundaries between regions 
known as twins inside the nanoparticle (these 
twins comprise regions in which the atomic 
ordering within each twin is perfect but the 
twins themselves differ from each other in 
their local atomic arrangements). In addition, 
they have produced what I believe are the first 
images of the 3D atomic structure right at the 
core of two types of well-known dislocation — 
edge and screw dislocations — in the nanopar- 
ticle. Many of us remember materials-science 
lectures and textbooks in which we saw clas- 
sical 3D block images of an edge or a screw 
dislocation (Fig. 1), but it is remarkable to see 
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this realized for individual atoms (see Figs 3 
and 4 of the paper’). 

Looking to the future, of particular interest 
for this 3D Fourier electron-tomography tech- 
nique would be the development of a capabil- 
ity to map the exact location of every atom in 
a crystal. This is achievable to some extent in 
macromolecular or protein crystallography, 
although in these situations researchers have 
a priori knowledge of the molecular peptide 
sequence and stereochemistry, which helps 
greatly in refining the atomic positional mod- 
elling. The extension of 3D Fourier electron 
tomography to structures larger than nano- 
particles should also be possible in principle, 
because the technique seems to be mainly con- 
strained by electron scattering, and so imaging 
should depend chiefly on the electron-beam 
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Mutations in the enzyme isocitrate dehydrogenase lead to the accumulation ofa 
metabolite that seems to promote cancer by influencing the epigenetic status of 
cells. But the effects are reversible, hinting at therapeutic targets. 


ABIGAIL S. KRALL & HEATHER R. CHRISTOFK 


r | The discovery of cancer-associated muta- 
tions in metabolic enzymes has fuelled 
the idea that altered cellular metabo- 

lism may be involved in the development 
of cancer. For example, mutations in isoci- 
trate dehydrogenases 1 and 2 are commonly 
detected in brain tumours and leukaemia. 
These mutations cause the enzymes to pro- 
duce the (R)-enantiomer of a metabolite called 
2-hydroxyglutarate, which can accumulate in 
tissues!”. Writing in Science, Losman et al? 
now show that the presence of this metabo- 
lite alone can promote cancer-like properties 
in cells. Notably, they also demonstrate that 
its cancer-promoting effects are reversible, 
suggesting that therapies targeting isocitrate- 
dehydrogenase mutants might improve patient 
prognoses. 

Isocitrate dehydrogenases 1 and 2 (IDH1 
and IDH2) function in the cytoplasm and in 
the citric-acid cycle, respectively, to convert 
isocitrate to a-ketoglutarate (Fig. 1). More 
than 75% of brain tumours and around 20% 
of acute myeloid leukaemias harbour muta- 
tions in IDH1 or IDH2*” that confer on the 
enzymes the gain-of-function ability to con- 
vert a-ketoglutarate to (R)-2-hydroxyglutarate 
((R)-2HG)'”, which suggests that these muta- 
tions transform cells in ways that contribute to 
tumour development. 
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Losman and colleagues tested this idea by 
expressing a cancer-associated mutant ver- 
sion of IDH1 in immature red blood cells. 
They observed that the cells were able to grow 
in the absence of certain, usually essential, 
growth factors, and that their differentiation 
into mature cells was impaired — two hall- 
marks of leukaemic cells. Moreover, the addi- 
tion of cell-membrane-permeable (R)-2HG 
to cells without an IDH mutation had the 
same effects, indicating that this metabo- 
lite is the transforming agent — a bona fide 
‘oncometabolite’ 

Dioxygenase enzymes that use a-keto- 
glutarate as a cofactor function in diverse 
cellular processes, such as adaptation to low 
oxygen levels or epigenetic modifications 
(alterations, such as methylation, to DNA 
or associated histone proteins that regulate 
gene expression without changing the DNA 
sequence). (R)-2HG, which structurally 
resembles a-ketoglutarate, promotes the activ- 
ity of some of these enzymes by substituting 
for a-ketoglutarate®, and inhibits others by 
competing with a-ketoglutarate for active-site 
binding’. For example, (R)-2HG inhibits the 
dioxygenase TET2 (ref. 7), whose activity leads 
to reduced DNA methylation. It also inhibits 
some histone demethylase enzymes, such as 
KDM enzymes”®. Inhibition of these epige- 
netic modifiers results in DNA and histone 
‘hypermethylation signatures’ that can alter 
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energy and the composition of the material 
under test. m 
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gene expression to block cellular differentia- 
tion’. IDH mutations are therefore thought 
to promote cancer by modifying the epige- 
nome to maintain cells in a stem-cell-like state. 

In a series of compelling experiments, 
Losman et al. provide evidence that (R)-2HG 
transforms cells by inhibiting the activity of 
TET2. They find that blocking TET2 activ- 
ity in cells recapitulated (R)-2HG-mediated 
growth-factor independence and impaired 
differentiation, reinforcing the suggestion that 
TET2 inhibition and the resulting epigenetic 
changes contribute to (R)-2HG-induced 
transformation. 

However, the authors also identify one 
a-ketoglutarate-dependent dioxygenase whose 
activity must be maintained for (R)-2HG- 
mediated leukaemic transformation. This is 
EGLN, an enzyme that marks the transcrip- 
tion factor HIF-a for degradation (Fig. 1). 
This piece of the puzzle was elucidated when 
Losman et al. observed that the (S)-enantiomer 
of 2-hydroxyglutarate, (S)-2HG, does not pro- 
mote leukaemia development, even though it 
inhibits TET2 and histone demethylases more 
potently than does (R)-2HG*”’. Therefore, 
it seemed probable that (R)-2HG promotes 
cellular transformation by modulating the 
activity of another dioxygenase. 

It was recently demonstrated that (R)-2HG 
activates EGLN, whereas (S)-2HG inhibits it®. 
Consequently, Losman et al. reasoned that leu- 
kaemic transformation resulting from mutant 
IDH or (R)-2HG might require EGLN activ- 
ity. Confirming this hypothesis, they observed 
that decreasing EGLN expression blocked 
transformation by mutant IDH or TET2 loss. 
Although the authors show that TET2 inhibi- 
tion is sufficient for transformation and that 
EGLN activity is necessary for (R)-2HG-medi- 
ated transformation, whether TET2 inhibition 
is necessary for transformation by (R)-2HG 
remains to be tested. 

The dependence of mutant IDH on EGLN 
activity during leukaemia development is sur- 
prising, as it contradicts the traditional view 
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Figure 1 | An oncogenic metabolite. Mutations in the metabolic enzyme isocitrate dehydrogenase 
(IDH), which converts isocitrate to a-ketoglutarate can lead to accumulation of the normally irrelevant 
metabolite (R)-2HG. Losman et al.* show that the presence of (R)-2HG causes cells to become 
independent of growth factors and to lose the ability to progress from a stem-cell-like progenitor to a fully 
differentiated state — two hallmarks of leukaemic cells. The authors’ data suggest that this occurs because 
the metabolite impairs the activity of a-ketoglutarate-dependent enzymes such as TET2 and KDM, which 
promote differentiation by removing methyl groups from DNA and DNA-associated histone proteins 

to alter gene expression. The authors also show that (R)-2HG-mediated cellular changes depend on the 
activity of EGLN, another enzyme that uses a-ketoglutarate as a cofactor. EGLN marks the transcription 
factor HIF-a for degradation. However, HIF-a is typically associated with cancer promotion, so the role of 
EGLN in (R)-2HG-mediated cancer promotion remains unclear. 


of HIF-1a as a cancer-promoting protein. 
However, in support of Losman and col- 
leagues’ findings, previous studies have 
shown that HIF-1a can inhibit leukaemic-cell 
proliferation and induce the differentiation 
of myeloid cells’*. This occurs in the absence 
of its typical transcription-factor partner 
HIF-1£, and seems to depend instead on 
CEBPa, a transcription factor that is crucial 
for blood-cell differentiation’. Further stud- 
ies are needed to determine whether HIF-a 
loss is a major factor in (R)-2HG-mediated 
transformation. 

Importantly, Losman et al. also demonstrate 
that the oncogenic effects of (R)-2HG are 
reversible — its removal restored the growth- 
factor dependence and differentiation capacity 
of cells even after long-term exposure to the 
metabolite. This reversibility is a key observa- 
tion because mutant IDH may transform cells 
by altering the epigenome. The stable, inher- 
itable nature of epigenetic modifications had 
raised concerns that targeting mutant IDH 
would not be therapeutically effective. But if 
(R)-2HG removal also results in reversion of 
hypermethylation signatures, this would indi- 
cate that the epigenetic modifications relevant 
to mutant IDH-mediated transformation are 
more dynamic than previously thought. Alter- 
natively, however, the features of transformed 


cells may arise from atypical functions of 
epigenetic modifiers such as TET2, or solely 
from the activities of other enzymes affected by 
(R)-2HG, such as EGLN. 

Mounting evidence suggests that metabo- 
lism and cellular epigenetic states are intercon- 
nected: because many epigenetic modifiers 
depend on metabolites as cofactors or sub- 
strates, the cell’s metabolic state can influ- 
ence its epigenetic state. Moreover, epigenetic 
modifications regulate many cellular pro- 
cesses central to cancer development. The 
study by Losman et al. bridges the fields of 
cancer, metabolism and epigenetics by show- 
ing that a metabolite that results from a genetic 
abnormality can promote cancer, possibly by 
changing the epigenetic landscape of the cell to 
alter gene expression. Continued study of the 
dynamic interplay between metabolism and 
epigenetics in cancer should lead to improved 
understanding of disease development and 
better therapeutic options. m 
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The same, but better 


Moving beyond mimicry, biologically inspired artificial materials can be simpler 
in design yet more powerful in function than their natural analogues. A tropical 
fruit seed serves as a guide to making new photonic elements. 


TERI W. ODOM 


ature’s bounty has provided the 
| \ | inspiration for a wide range of 
man-made photonic materials. For 
example, opals have given rise to artificial 
crystals made of colloidal building blocks’, 
and the starfish-related brittlestar has shown 
how microlenses can act like a compound eye’. 
Now, as Kolle et al. report’ in Advanced Mater- 
ials, tropical plant seeds can be added to the 
list, and with them, a new biologically inspired 
photonic element — the soft optical fibre. The 
seeds of the rainforest plant Margaritaria nobi- 
lis look blue-green because of the way in which 
light interacts with the cells in the plant’s seed 
coat, which are elongated, with a cross-section 
rather like a Swiss roll in structure (Fig. 1a,b). 
This hierarchical architecture, in which the 
overall curvature of the cell is microscale and 
the concentric features within the cell are 
nanoscale, provided the guiding principles for 
the authors’ design of a multilayered photonic 
fibre (Fig. 1c). 

What is significant about Kolle and 
colleagues’ work is that they did not simply 
mimic the cellular structure of the coat, and 
hence obtain similar blue-green colours. 
Instead, they created a structure that had 
a simpler geometry which could not only 
produce blue and green colours, but also 
generate tunable colours over the visible 
spectrum. The key to this greater flexibility 
resides in the use of soft materials and in the 
fabrication method. 

The colours of the seed coat are determined 
by the thickness and refractive index of the 
two alternating materials of the Swiss-roll 
structure, whereas the brightness is controlled 
primarily by the number of bilayers in the roll. 
When light shines on a multilayered stack of 
two alternating materials, the reflected col- 
our — whose frequency falls within a range 
of ‘forbidden’ frequencies called the photonic 
bandgap — is preserved only at angles perpen- 
dicular to the surface. But because the cells in 
the seed coat have microscale curvature in 
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addition to the concentric nanoscale multi- 
layers, and the fruit itself has macroscale 
curvature, the blue-green colours can be 
observed over a wider range of angles. 

To create a bilayer with different refractive 
indices, Kolle et al. used two polymers, poly- 
dimethylsiloxane (PDMS) and polystyrene- 
polyisoprene triblock copolymer (PSPI). The 
thicknesses of the two polymer layers deter- 
mined the central frequency of the photonic 
bandgap. For example, a PDMS film 240 nano- 
metres thick and a PSPI 100 nm thick that were 
rolled around a 14-micrometre-diameter glass 
rod nearly 80 times resulted in a green fibre 
(Fig. 1d). With its circular cross-section, this 
soft fibre could reflect green colour without 
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needing the structural complexities of the seed 
coat, such as the cells’ elliptical cross-section 
and the fine structure within their nanoscale 
layers. 

Kolle et al. took advantage of the elastic 
properties of the polymer layers to tune the 
optical properties of the soft fibre. After the 
glass rod was removed, the now-hollow fibre 
could be mechanically deformed. Stretching 
along the axis of the fibre was accompanied by 
compression in the radial direction according 
to Poisson’s ratio for elastic materials under 
strain. Therefore, the authors could tune the 
fibre colour from the starting colour to shorter 
wavelengths simply by stretching. Tuning 
the fibre colour to longer wavelengths could 
potentially be achieved by inducing swelling, 
and thereby increasing the thickness of the 
polymer layers, by exposure to organic sol- 
vents. This type of flexibility is not possible in 
the tropical fruit seed coat or in other natural 
structures made from hard materials. 

Despite the potential of these soft photonic 
fibres, three main challenges remain: colour 
uniformity, colour purity and scalability. For 
the first problem, when the fibre is stretched 
along its axis, the strain is not uniformly 


Figure 1 | Fruit-inspired optical fibres. a, The fruit of the tropical plant Margaritaria nobilis without 
its capsule (about 10 mm in diameter). b, Cross-section of the fruit seed coat shows a concentric, 
multilayered cellular structure. c, Cross-section ofa soft multilayered optical fibre around a glass core, 
fabricated by Kolle and colleagues’. d, Reflected green colour from the soft optical fibre in c. 
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distributed along the length of the fibre and is 
most different where the fibres are clamped for 
stretching. Because the strain is non-uniform, 
the reflected colour will be also. In addition, 
if there is any slippage between the bilayers as 
the fibre is strained, there will be non-uniform 
colour in random areas. For the second issue, 
having a circular rather than an elliptical cross- 
section results in a larger gradient in colour 
variation. This could be overcome by start- 
ing the rolling process with a rod that has an 
elliptical or rectangular cross-section. For the 
third concern, the rolling of bilayers around a 
glass-rod template is cumbersome, and the rod 
must later be removed to obtain tunable col- 
our. This might be addressed by pre-straining 
the bottom layer in the bilayer such that the 
released strain would result in self-rolling with 
the appropriate inner diameter. 

Overall, however, the versatility of Kolle 
and collleagues’ fabrication approach bodes 
well for a range of future applications. For 
example, if insulator-metal bilayers can be 
rolled, and if the free-standing metal layer 
were to contain nanoscale patterns’, then 
fibre metamaterials could be formed. Such 
fibres could increase speeds for telecommuni- 
cations and sensing. In addition, the authors’ 
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technique could be used to make multilayer 
claddings out of many more combinations of 
materials than those limited by standard fibre 
drawing, which can be done only at high tem- 
peratures’. The room-temperature process- 
ing of the soft fibres is a distinct advantage. 
Finally, these flexible colour fibres could be 
woven into garments used for camouflage by 
subjecting them to external stimuli such as 
strain or volatile solvents. The re-engineering 
of nature’s resources for photonic materials 
with enhanced capabilities has never looked 
more fruitful. = 
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Sirtuin on a high-fat diet 


It emerges that the sirtuin enzyme SIRT6 preferentially removes long-chain 
fatty-acyl, rather than acetyl, protein modifications. This activity regulates 
secretion of the inflammation-associated protein TNF-a. SEE LETTER P.110 


POONAM BHEDA & CYNTHIA WOLBERGER 


. proces regulate a broad range of 


processes, including transcription, 
metabolism, fat mobilization, neuro- 
degeneration and ageing. The various func- 
tions of these proteins have been largely 
ascribed to their ability to catalyse the removal 
of acetyl groups from the lysine amino-acid 
residues of other proteins through their 
deacetylase activity. Yet the exact biologi- 
cal action of sirtuins remains unclear. For 
instance, one sirtuin, SIRT6, which has been 
implicated in genome stability, inflammation, 
cancer-cell metabolism and even lifespan, is 
a very weak deacetylase’. On page 110 of this 
issue, Jiang et al.’ report the surprising discov- 
ery that SIRT6 robustly removes a myristoyl 
group — a long-chain fatty-acyl group — from 
lysine residues, and that this biochemical activ- 
ity enables the enzyme to regulate the secretion 
of TNF-a, a cytokine protein released from 
cells during inflammation. 
Proteins undergo a diverse array of chemical 
modifications that modulate their activity. The 


enzymes that add and remove these modifi- 
cations are therefore key decision-makers in 
signalling cascades. The lysine side chains of 
proteins can be modifed by attachment of a 
small acyl group called acetyl, which is one of 
the most common regulatory modifications 
and is best known for its role in controlling 
transcription. Other, larger acyl modifications 
of lysine residues have been detected, although 
their biological roles are largely unknown. 
Most sirtuins deacetylate lysine side chains 
in a reaction that consumes the cofactor 
nicotinamide adenine dinucleotide (NAD‘*), 
releasing nicotinamide and transferring the 
acetyl group from the lysine to the remain- 
ing ADP-ribose moiety of NAD* to form 
O-acetyl ADP-ribose (Fig. 1). Most of the seven 
human sirtuins (SIRT1-7) exhibit this typi- 
cal activity, although some, including SIRT6, 
have either weak or no deacetylase activity. 
For instance, SIRT5 preferentially binds to 
and removes succinyl and malonyl modifica- 
tions from lysine’. These acyl groups are larger 
than acetyls and, unlike them, are negatively 
charged, but they are linked to lysine by the 
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50 Years Ago 


It is surprising to find that despite 
the widespread attention directed 
to the United Nations conference 
last February on the application 
of science and technology for 

the benefit of the less-developed 
areas, the admirable issue “World 
of Opportunity” of the New 
Scientist of February 14 still 
appears to remain the only one 

in which that conference has 
been at all adequately handled 

by British periodicals. This 
comparative neglect, in spite of 
copious literature distributed by 
the Department of Scientific and 
Industrial Research, in no way 
detracts from the excellence of the 
issue of the New Scientist which is 
as welcome for its enterprise as 
for its intrinsic and timely 

merit. 

From Nature 6 April 1963 


100 Years Ago 


Disastrous floods have followed 
the severe windstorms in the 
United States on March 23. The 
areas chiefly affected are the 
middle western States ... Immense 
tracts of country have been 
submerged, and many large towns 
have become flooded. Much of 
the ground is below the flood 
level of the rivers, and in parts the 
embankments have given way, 
whilst many tributary rivers have 
overflowed their banks. Dayton, 
Indianapolis, Columbus, and 
numerous other smaller towns 
have suffered immensely during 
the last week of March and much 
loss of life has occurred. Hundreds 
of houses have been washed 

away, and immense suffering has 
been caused ... The rivers are said 
to be still rising in many places, 
and the full result of the disaster 
will depend largely upon the 
weather for the next week 

or two. 

From Nature 3 April 1913 
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Figure 1 | Protein deacylation. Sirtuins are known to act as deacetylase 
enzymes. They remove the acyl group (R) acetyl from the lysine side chain 
of substrate proteins in a reaction involving the NAD* cofactor and 
transfer it to the remaining ADP-ribose moiety of NAD* (green) to form 


same type of chemical bond and are removed 
in the same NAD*-dependent enzymatic 
reaction as that catalysed by other sirtuins. 

The SIRT5 precedent prompted Jiang et al. 
to investigate whether the apparently weak 
deacetylase activity of SIRT6 also reflects a 
preference for other acyl-lysine substrates. 
Their search began in vitro with the use of 
chemically synthesized peptides bearing 
lysines modified with various acyl groups that 
are known to occur in cells. The result was 
clear: SIRT6 was far more active in removing 
the long-chain fatty-acyl myristoyl and pal- 
mitoyl groups than small acyl modifications, 
including acetyl groups. 

The authors’ crystal structure of SIRT6 
bound to a myristoylated peptide and ADP- 
ribose shows that SIRT6 contains an expanded 
hydrophobic channel in which it can accom- 
modate the 14-carbon myristoyl chain. 
Together with previous structural studies*” 
showing how different sirtuins can accom- 
modate various acyl modifications, includ- 
ing succinyl, malonyl and propiony] lysines, 
it now seems clear that structural features in 
the sirtuins’ active sites govern the preference 
of each one for removing a particular type of 
acyl modification. 

In vivo, SIRT6 is known to regulate the lev- 
els of TNF-a, which spans the cell membrane 
and is cleaved by a membrane-associated pro- 
tease enzyme, resulting in the secretion of this 
cytokine’s extracellular domain. The cytoplas- 
mic domain of TNF-a contains two myris- 
toylated lysines. This immediately raises two 
questions: does SIRT6 remove these myristoyl 
modifications and, if so, is this enzymatic activ- 
ity somehow connected to TNF-a regulation? 

Indeed, Jiang et al. found that the myris- 
toylation level of TNF-a in cultured cells 
depended on the enzymatic activity of SIRT6. 
Secretion of TNF-a also required SIRT6, 


42 | NATURE | VOL 496 | 4 APRIL 2013 


indicating that removal of its myristoyl groups 
is a key step in this process. It will be interesting 
to see how myristoylation regulates secretion, 
and whether removal of the myristoyl groups 
causes a conformational rearrangement in 
TNF-a that allows its cleavage by the mem- 
brane-associated protease. 

Jiang and colleagues’ findings set the stage 
for several new directions in which to inves- 
tigate the role of fatty-acyl modifications and 
their regulation by sirtuins. The paper should 
be the final impetus for reclassifying sirtuins 
as lysine deacylases’, and not simply deacety- 
lases, to reflect the more general nature of their 
enzymatic activity. 

Importantly, it now seems that sirtuins 
differ from one another in the type of acyl 
modification they preferentially remove 
from substrates, although current findings 
do not rule out the possibility that a given sir- 
tuin can remove several types of acyl modi- 
fication in vivo. This is perhaps the case for 
SIRT6. Although the enzyme preferentially 
removes long-chain fatty-acyl modifications, 
its deacetylase activity has been implicated’ in 
modification of the DNA-associated histone 
H3 protein. It could be that, in vivo, the weak 
deacetylase activity of SIRT6 is restricted to 
specific substrates (as has been shown‘), to 
precise subcellular localizations or to particu- 
lar signalling pathways. The fact that SIRT6 
can remove a broader spectrum of acyl modi- 
fications should make it possible to tease out 
the relative contributions of these biochemical 
activities to this enzyme’s function. 

It has long been a mystery why sirtuins con- 
sume the energetically costly NAD* cofactor, 
rather than — like other classes of deacety- 
lase — using simple hydrolysis to deacetylate 
substrates. Explanations invoked include the 
regulatory implications of coupling sirtuin 
activity to the cell’s metabolic state, in which 
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O-acetyl ADP-ribose. (Nicotinamide is a by-product of the reaction.) Jiang 

et al.” report that SIRT6 preferentially removes a different, long-chain acyl 
group (myristoyl) from proteins — a finding that, together with previous data, 
call for a reclassification of sirtuins as deacylases. 


NAD‘ is involved, or the possible signalling 
role of the O-acetyl ADP-ribose product 
(Fig. 1). Another explanation may be the need 
to generate a nucleotide carrier for the depart- 
ing acyl group. On transferring from the lysine 
substrate to ADP-ribose to form O-myristoyl 
ADP-ribose, the hydrophobic myristate is 
released in a more soluble conjugated form. 
The O-myristoyl ADP-ribose conjugation also 
prevents myristate from becoming elongated 
by further enzymatic reaction to form another 
fatty acid, palmitate, thereby preserving the 
available pool of myristate. 

O-Acetyl ADP-ribose is broken down by 
several enzymes’, and it remains to be seen 
which of them regulate the levels of other 
O-acyl ADP-ribose products or transfer them 
to other carriers, and whether these activi- 
ties affect or even drive upstream pathways 
involved in generating these metabolites. 
For now, the discovery of SIRT6-mediated 
demyristoylation opens an exciting chapter in 
the story of the seven human sirtuins and their 
biological activity. m 
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Patterns and mechanisms of early 


Pliocene warmth 


A. V. Fedorov", C. M. Brierley'**, K. T. Lawrence**, Z. Liu*, P. S. Dekens® & A. C. Ravelo® 


About five to four million years ago, in the early Pliocene epoch, Earth had a warm, temperate climate. The gradual 
cooling that followed led to the establishment of modern temperature patterns, possibly in response to a decrease in 
atmospheric CO, concentration, of the order of 100 parts per million, towards preindustrial values. Here we synthesize 
the available geochemical proxy records of sea surface temperature and show that, compared with that of today, the 
early Pliocene climate had substantially lower meridional and zonal temperature gradients but similar maximum ocean 
temperatures. Using an Earth system model, we show that none of the mechanisms currently proposed to explain 
Pliocene warmth can simultaneously reproduce all three crucial features. We suggest that a combination of several 
dynamical feedbacks underestimated in the models at present, such as those related to ocean mixing and cloud 
albedo, may have been responsible for these climate conditions. 


Earth’s climate evolution over the last five million years (Myr), since 
the early Pliocene epoch, has been meticulously studied by scientists 
(see, for example, refs 1-3). For years, the focus of attention was on the 
origin* * of glacial cycles, that is, the coming and going of continental 
ice sheets evident in the 5’*O record reflecting global ice volume and 
deep-ocean temperature (Fig. 1a). However, a wealth of new data now 
show that the gradual onset (around 3 Myr ago) and further amp- 
lification of Northern Hemisphere glaciation (Fig. 1a, b) was only one 
facet of this climate change and was a consequence of the global 
cooling rather than its initial cause”*. 

The early Pliocene itself, the warm interval that preceded the gla- 
ciation, has attracted a lot of attention as a possible analogue for future 
climate conditions. Despite relatively small differences in climate 
control factors, including CO, concentrations (Fig. 1c), between the 
early Pliocene and the present, the former was markedly different’. 
Palaeorecords indicate vast changes in climate patterns since 5-4 Myr 
ago, including a contraction of the tropical belt and oceanic warm pool’, 
emergence of strong temperature gradients along the Equator"’, cooling 
of coastal upwelling zones in the subtropics’, the shoaling of the ocean 
thermocline’’”’ and cooling of the high-latitude and deep ocean'*”’. 
Together these observations imply a large structural change in climate, 
with major global and regional implications. 

By a structural change, we mean a transition from a climate with 
almost no zonal sea surface temperature (SST) gradients (~1 °C or 
less) and a weak meridional SST gradient to a modern climate with 
much more pronounced spatial temperature contrasts. This structural 
climate change was paralleled by relatively small changes in atmo- 
spheric CO, concentration (Fig. 1c), on the order of 50-100 p.p.m. 
Thus, a key step towards understanding early Pliocene climate, and 
possibly other warm climates throughout Earth history, is to explain 
what could cause such changes. 

The objectives of this article are twofold. First we review recent 
geochemically derived palaeorecords emphasizing the three criteria 
that any hypothesis must satisfy to account for the climate of the early 
Pliocene (reduced meridional temperature gradient, weak zonal gra- 
dients and SST stability). Then we evaluate various explanations for 


early Pliocene climate with respect to these criteria, using a single 
modelling framework. 


Observations 


Substantial effort has been put into deriving palaeoclimate records for 
the Pliocene because it is recognized as the most recent example of 
prolonged global warmth in the geological past*’®. As part of the 
PRISM project, this effort produced a series of global temperature 
reconstructions’””* for a ‘time slab’ of the mid Pliocene (3-3.3 Myr 
ago). These data sets and associated modelling studies enable a coor- 
dinated comparison between the Pliocene and the present, and dra- 
matically increased our understanding of Pliocene climate. 

In the present study, we compile available SST proxy records, most 
of them continuous, to describe the long-term climate trends from the 
warm early Pliocene to the present (Figs 2 and 3, and Methods). As 
several of these records indicate, the period from 4.4 to 4 Myr ago was 
probably the warmest interval within this timeframe—the Pliocene 
climatic optimum (Fig. 2b, c). The subsequent cooling became evident 
roughly at the same time in both hemispheres (Fig. 2c, d) and involved 
regions ranging from low-latitude upwelling zones to mid and high 
latitudes. The expansion of the Northern Hemisphere ice sheets 
around 2.7 Myr ago, evidenced by an increase in the magnetic sus- 
ceptibility of sediments affected by ice-rafted debris (Fig. 1b), intro- 
duced a strong interhemispheric asymmetry in the climate evolution 
over land’ but to a lesser degree over the ocean (as shown by the 
similarity of trends in Fig. 3c, d). 

Within the global cooling pattern, local trends featured significant 
spatial and temporal variations not necessarily tracking the growth of 
continental ice sheets inferred from 5'*O data (Fig. 1a). Regionally, 
temperature changes were as high as 11 °C over the ocean” (Fig. 3c) 
and 19 °C over the land’’. Below, we focus on the critical features that 
characterize the evolution of climate structure since the early Plio- 
cene. As is traditionally done by the palaeoclimatology community, 
we compare the mean trends incorporating both glacial and intergla- 
cial intervals of the time series, even though some of our statements 
contrast the late Quaternary interglacials and Pliocene interglacials. A 
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Figure 1 | Climate evolution over the past 5 Myr. 


a, Variations in benthic 5'8O (PeeDee Belemnite 
[— (PDB) standard; Methods) from deep-ocean 
sediments, showing glacial cycles (light blue) anda 
[— long-term trend” (dark blue). 5'80 records past 
variations in deep-ocean temperature and global 
ir 2 ice volume. The shading highlights the period 
during which Northern Hemisphere glaciation 
[— gradually begins’. b, Magnetic susceptibility of 
ocean sediments in the northern Pacific”' (green) 
= and Atlantic” (red). The units are nondimensional 
and the data are presented as originally published. 
[— Higher susceptibility indicates the presence of 


more ice-rafted debris, corresponding to larger 
Northern Hemisphere ice sheets. ODP, Ocean 
Drilling Program. c, Proxy pco, data from marine 
geochemical studies and ice-core inclusions (Pco, 
partial pressure of CO2; Supplementary Table 1). 
Uncertainties, indicated by the widths of the data 
= bands, are defined in the respective studies (see 
Methods for extra references). The two dashed 


lines indicate the preindustrial (280 p.p.m.) and 
current (390 p.p.m. in 2010) concentrations of 
atmospheric CO3. 
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further comparison between the mean and ‘interglacial’ trends sup- 
ports our main conclusions (Methods and Supplementary Figs 1-3). 


Warm pool temperatures and CO» 

The first notable aspect of the Pliocene—Pleistocene climate evolution 
is the stability of warmest SSTs in the tropical warm pool over the past 
5 Myr. In all three tropical oceans, these temperatures stayed fairly 
constant and the long-term average remained ~29 °C (Figs 2a and 3a), 
similar to the present. This is especially surprising given temperature 
changes elsewhere. It has been suggested that Mg/Ca-based measure- 
ments, providing many warm pool records, should be corrected for 
secular changes in seawater composition”'. However, general agreement 
between Mg/Ca and alkenone data in locations where both proxies are 
available (ref. 12 and Methods) indicates that this correction should be 
small, comparable to calibration errors’’. 

Although a number of thermostat mechanisms capable of main- 
taining the stability of warm pool SST in high-CO, climates have been 
proposed”, we find that they may not be necessary to explain the 
Pliocene climate. A pervasive increase in atmospheric CO, is the 
driver for present anthropogenic climate change, and is expected to 
cause even larger changes in the future'®. Similarly, CO, remains a 
suspected cause for early Pliocene warmth, and several biogeoche- 
mical methods have been devised to estimate its concentration (Fig. 1c). 
Despite large uncertainties, together these proxy data suggest that 
Pliocene CO, concentrations were only 50-100 p.p.m. higher than 
preindustrial values (~280 p.p.m.). Climate models produce a ~1 °C 
temperature rise in the warm pool when the CO, concentration 
increases by 100p.p.m."*, and a change by ~2°C in doubling-of- 
CO, experiments”. Thus, within data uncertainty (Methods), the 
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stability of the warm pool temperatures is not inconsistent with the 
relatively small CO, change. 


Increasing meridional temperature gradients 
High-latitude warmth and a reduced Equator-to-pole temperature 
gradient are other dominant features of early Pliocene climate**. 
Marine data, available between latitudes 43°S and 58° N (Fig. 2e), 
suggest subsequent ocean cooling of 4-8 °C in the mid and high 
latitudes of the Atlantic and Pacific (Figs 2d and 3d). The temperature 
of Atlantic deep waters (ODP site 607; 3,400-m depth) basically fol- 
lows SST evolution in the North Atlantic, implying a similar cooling 
over the ocean convection regions at higher latitudes (~70° N). In ter- 
restrial regions of the Arctic, temperature fell by nearly 19 °C (ref. 19). 
The meridional SST gradient as measured from the Equator to the 
subtropics was also significantly smaller in the early Pliocene than at 
present’? (Fig. 4a—c), and the meridional temperature distribution 
within the Tropics was more uniform’®. Consequently, despite little 
difference in the warmest SSTs (Figs 2a and 3a), the meridional extent 
of the warm pool was much broader in the early Pliocene. The sub- 
sequent cooling led to a gradual contraction of the warm pool towards 
the Equator, as evidenced by the increase in meridional temperature 
gradient. This contraction is also apparent from temperature records at 
the edge of the warm pool in the South China Sea**”° (Supplementary 
Fig. 9). Whereas data from inside the warm pool show fairly constant 
temperatures, these particular records show a clear cooling trend. 
The reduced meridional SST gradient seems to be important for under- 
standing the high global mean temperature and other characteristics of 
the early Pliocene®*"®. For instance, from the atmospheric perspective, as 
the temperature contrast from the Equator to the subtropics increased, the 
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surface high-pressure zones in low latitudes (subtropical highs) stren- 
gthened, whereas the Hadley cells (atmospheric meridional circulation) 
intensified and contracted slightly towards the Equator. The strengthen- 
ing of atmospheric circulation led to a stronger subsidence, resulting in 
aridification of parts of Africa, Australia and North America*”’. 
From the oceanic perspective, as the meridional SST gradient 
increased so did upper-ocean stratification in the tropics, because 
oceanic vertical thermal structure is directly related to the meridional 
distribution of surface temperature**”’. This increase in stratification 
should have contributed to the basin-wide shoaling of the tropical 
thermocline®’**’, which has been under way since 5 Myr ago'"*°?!, 


Strengthening of cold upwelling 
As the meridional SST gradient increased and the tropical thermo- 
cline shoaled, cold waters became present at low latitudes, with dra- 
matic consequences for the tropics. Initially the cold waters appeared 
in the coastal subtropical upwelling sites in the Atlantic and Pacific 
oceans, in both hemispheres (Fig. 2c and 3c). The strengthening of 
anticyclonic winds within the subtropical highs may have contributed 
to the stronger upwelling®”®. 

The shoaling of the thermocline culminated in the appearance of 
colder water along the Equator (Figs 2b and 3b) and the formation of 


a salient feature of the present-day SST pattern, namely the equatorial 
‘cold tongues’ on the eastern sides of the Pacific and Atlantic basins. 
The corresponding development of the zonal temperature gradients 
along the Equator (Fig. 4d-f) and the ensuing Bjerknes feedback 
(a stronger SST gradient leads to stronger winds and, thence, a stronger 
gradient) led to the intensification of the zonal atmospheric circula- 
tion—the Walker cell*"°. 

In the present climate, an intermittent weakening of both the winds 
and the east-west equatorial temperature gradient occurs during El 
Nino, but the mean climate is still characterized by pronounced cold 
tongues in the Pacific and the Atlantic. These tongues are maintained 
by a persistent supply of colder water from the extratropics by shallow 
overturning cells**. The cold tongues and subtropical coastal up- 
welling zones are the main low-latitude regions where the ocean gains 
heat to balance that lost in high latitudes’”’*. Apparently, during the 
early Pliocene the cold water was mainly absent in the eastern equa- 
torial Pacific, and the zonal SST gradient along the Equator was small 
(Fig. 4d-f). This state is sometimes referred to as ‘permanent El Nino- 
like’ (although this implies nothing about interannual variability). A 
similar situation occurred in the Atlantic (Fig. 2a, b). 

The zonal SST gradient in the two oceans reached its modern values 
between 2 and 1 Myr ago (Fig. 4). This was a gradual process rather than 
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Figure 3 | Temperature changes in different 


regions of the ocean. Relative changes in 
temperature with respect to the beginning of each 
trend line. Solid and dashed lines are as in Fig. 2. 
Note the stability of warm pool temperatures 

(a) with respect to SST changes in other regions 
of the ocean (b-d). The strongest temperature 
change is seen in some of the coastal upwelling 
regions. For changes computed only for interglacial 
temperatures, see Supplementary Fig. 2. 
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a swift transition. The magnitude of cooling of the eastern equatorial 
Pacific (4-5 °C over 4 Myr) appears to be just slightly lower than that in 
mid latitudes (Fig. 3b, d). 

A zonal SST gradient also developed in the Indian Ocean, where 
seasonal coastal upwelling became prominent in the Arabian Sea 
(Figs 2b and 3b). The cooling in this region since the early Pliocene 
may have contributed to a gradual strengthening of the Asian mon- 
soon through coupling between the winds and SST's (stronger winds 
lead to a greater SST contrast and, in turn, yet stronger winds). In an 
atmospheric general circulation model (GCM), transitioning from a 
uniformly warm Indian Ocean to a cooler Arabian Sea leads to the 
strengthening of the monsoon and the aridification of East Africa®. 


Mechanisms to explain Pliocene climate 

Simulations with atmospheric GCMs forced with different Pliocene 
SST reconstructions suggest that the global mean temperature was 
2-3 °C higher** in the mid Pliocene (~3 Myr ago) and roughly 4°C 
higher® in the early Pliocene (~4 Myr ago) than the preindustrial 
temperature. Many studies have focused on the related problem of 
how the Northern Hemisphere stayed almost free of continental ice 
before the onset of the ice ages****”*. Several hypotheses were pro- 
posed, ranging from high CO, to enhanced ocean poleward heat 
transport*** caused by changes in ocean gateways*** and to changes 
in atmospheric processes and feedbacks. 

However, the Pliocene problem” goes much farther than the issues 
of mean global temperatures or continental ice sheets. Accordingly, 
here we conduct experiments with a state-of-the-art coupled model 
(Methods) to explore whether these prevailing hypotheses are con- 
sistent with the three observational constraints on changes in tropical 
climate between the present and the early Pliocene, that is, the reduc- 
tion of meridional temperature gradient by 4-6°C between the 
Equator and, respectively, 30° N and 30° S; a similar 4-6 °C reduction 
of zonal SST gradient in the equatorial Pacific; and little SST increase 
(0-1 °C) in the warm pool. 


Higher atmospheric CO, 

In the first experiment, we increase CO, by 100 p.p.m. starting from 
the preindustrial climate. As expected, the climate response to this 
increase is amplified at high latitudes, leading to a reduction of the 
Equator-to-pole temperature gradient. Nevertheless, this reduction 
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between the Equator and mid latitudes reaches only 0.5 °C (Fig. 5a). 
The eastern equatorial Pacific warms by the same amount, but so does 
the warm pool. Asa result, the zonal gradients hardly change, whereas 
the tropical thermocline deepens very slightly (Fig. 5f). Typically, in 
other models*’, even for CO, doubling, changes in this gradient do not 
exceed +0.5 °C, suggesting that CO, alone cannot explain the early 
Pliocene SST patterns. 

A comprehensive simulation’’ of the mid Pliocene forced by CO, 
concentrations of 400 p.p.m. and PRISM3 land surface boundary 
conditions shows an increase in global mean temperature of around 
3 °C, of which half comes from the CO, increase and the rest from 
lower orography and reduced land albedo. The model simulates a 
warming by 3-4°C in some regions of the high-latitude Atlantic, 
partly attributed to the effect of a smaller Greenland ice sheet’***”*; 
however, SST gradients at low latitudes change little while tempera- 
ture in the warm pool rises by 2-3 °C. This confirms that by itself the 
relatively minor increase in CO, (50-100 p.p.m.) could not have 
induced the observed structural climate change, even with high 
Earth-system climate sensitivity’’”*. 


Ocean gateways 

Enhanced ocean poleward heat transport during the Pliocene has 
been proposed as another factor that contributes to keeping high 
latitudes warm, especially in the northern Atlantic***. However, 
climate models calculate changes in ocean heat transport interactively 
and so require particular forcing mechanisms, such as an open Cen- 
tral American seaway*’”’ (CAS). 

Originally, the CAS closure was discussed in the context of the 
onset of glaciation around 2.7 Myr ago”’, but the closure is now esti- 
mated to have occurred between 4.7 and 4.2 Myr ago, as inferred from 
the divergence of planktonic 5'*O between the Caribbean Sea and the 
Pacific’’!°. Recently, it has been suggested that the seaway closure, 
and the resulting intensification of deep-ocean circulation due to 
increased salinity in the Atlantic, could have led to a shoaling of the 
thermocline in the Pacific, facilitating the subsequent cooling”. 

Opening the CAS in our model to a depth of 150 m (ref. 5) causes 
a reduction in the strength of the Atlantic meridional overturning 
circulation, a cooling of the Northern Hemisphere and a warming 
of the Southern Hemisphere (Fig. 5c, h and Supplementary Fig. 11). 
However, impacts on the equatorial SSTs are minor. The equatorial 


©2013 Macmillan Publishers Limited. All rights reserved 


Meridional Zonal 


T T T T 
10 806 Mg/Ca-847 alkenones 


T T T T T T T 
California margin to eastern equatorial Pacific 


SST gradient (°C) ODP 846-1012 & 
SST gradient (°C) ODP 806-847 G 


L L L L L L L 
0 05 10 15 20 25 30 3.5 4.0 


T i T T t i t 
Eastern equatorial Atlantic to North Atlantic 


12 
~. 
8 
6 
4 
u 


! ! L ! ! L 
0 05 10 15 20 25 3.0 35 4.0 


SST gradient (°C) ODP 662-607 & 
3 
SST gradient (°C) ODP 806-847 


c f 
fo} 22 T T t T T T t 
soy Eastern equatorial Atlantic to South Atlantic o 
= 20 (cs) 
4 : 
© 18 ico) 
o a 
a 
6 16 8 
o) — o 
£14 a 
i = 
o G 
2 42 & 
g g 
io) io) 
- 10 a 
ip) n 
YD 8 L L L L L L L ao f f f f 
0 05 10 15 20 25 30 3.5 4.0 fe) 1.0 20 3.0 4.0 5.0 
Age (Myr) Age (Myr) 


Figure 4 | Evolution of SST gradients since 4-5 Myr ago. a-c, Meridional 
SST gradients (in the Pacific and Atlantic) from alkenone SST proxies. 

d-f, Zonal SST gradient along the Equator in the Pacific from different ODP 
sites and temperature proxies. Despite inherent uncertainties, the different data 
support the notion of the reduced zonal and meridional SST gradients in the 
early Pliocene. 


thermocline deepens a little in the central Pacific but shoals in the very 
east. Opening the CAS to the largest possible depth (1,100 m) barely 
changes the zonal SST gradient along the Equator, but amplifies the 
interhemispheric seesaw pattern with a strong cooling in the northern 
high latitudes (Supplementary Fig. 11), contradicting the observa- 
tions. Similar behaviour is seen in other models (ref. 41; Supplemen- 
tary Fig. 12). 

Another tectonic factor potentially important for Pliocene climate 
is the northward drift of New Guinea, which may have altered the 
source of waters entering the Indonesian throughflow and feeding 
the Equatorial undercurrent and upwelling’. However, in agreement 
with previous modelling studies*, our results show impacts barely 
above statistical significance (Fig. 5b, g) and do not explain the Plio- 
cene climate state. 


Effects of ocean mixing and cloud albedo 
Recently, dynamical feedbacks not included or possibly underesti- 
mated in climate models have also been suggested to explain early 
Pliocene warmth****. In particular, it has been argued that tropical 
cyclones during that time were more widespread, generating strong 
ocean vertical mixing in the subtropical bands. Such enhanced mixing 
would reduce oceanic heat transport from the equatorial region to the 
subtropics, deepening the tropical thermocline and warming the cold 
tongue. Presumably, the contraction of the warm pool and associated 
strengthening of vertical wind shear decreased tropical cyclone acti- 
vity, contributing to the cooling trends”. 

Our experiments show that a plausible increase in the subtropical 
mixing (Supplementary Fig. 13b) causes a reduction of the zonal SST 
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Figure 5 | Testing mechanisms of Pliocene warmth and reduced 
temperature gradients. Anomalies in SST (a-e) and subsurface temperature 
(f-j) between the simulated early Pliocene and the preindustrial control. All 
simulations use a coupled climate model (the NCAR Community Earth System 
Model) and are identical except for a particular model perturbation: increasing 
CO, concentrations by 100 p.p.m. from the preindustrial level (a, f); shifting 
islands of Indonesia southward (b, g); opening the CAS to a depth of 150m 
(c, h); increasing ocean mixing in the subtropical bands (d, i); and reducing 
extratropical cloud albedo (e, j). Landmasses blocked out in green were 
removed in the respective experiments. Thick dashed and solid lines in the 
right panels show the thermocline depth in the Control and perturbation 
experiments, respectively. 


gradient in the Pacific by ~2 °C, which is promising (Fig. 5d, i). This is 
the only experiment that causes a broad deepening of the thermocline 
along the Equator (Fig. 5i). However, the impacts also include an 
increase by almost 1 °C in the meridional SST gradient (associated with 
ocean cooling in hurricane wakes), rather than the required reduction. 

It has also been suggested that extratropical low stratus clouds 
were less reflective in the Pliocene“, possibly as a result of a different 
aerosol composition or a stronger cloud response to surface warming 
than coupled models typically predict. Our sensitivity experiment with 
a moderately reduced low-cloud albedo (Supplementary Fig. 13a) 
shows a reduction by 1-2 °C in the meridional SST gradient, a weak 
reduction in the zonal gradient and a modest deepening of the tropical 
thermocline (Fig. 5e, j). However, somewhat like the CO, experiment, 
this simulation increased warm pool SSTs while producing little 
change in the thermocline depth. 
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Quantifying structural climate change 

We can assess how well these experiments capture a key characteristic 
of the Pliocene climate, that is, a more uniform SST field at low 
latitudes. A quantitative measure that reflects the structural aspects 
of climate change is the change in the spatial variance of time- 
averaged temperatures evaluated for particular regions of the ocean 
(Supplementary Table 4). Weakened zonal and meridional temper- 
ature gradients imply less spatial diversity in temperature, and so the 
early Pliocene should have a smaller variance at low latitudes as com- 
pared with the present. None of the aforementioned simulations 
reduces the tropical variance computed within the band between 
30°N and 30°S by more than 5%, whereas some actually increase 
it. When computed for the equatorial band (10° N to 10° S), the vari- 
ance is 40% lower in the ocean-mixing experiment but less than 15% 
lower in the others. 


Discussion and outlook 


Palaeorecords allow us to identify critical features of the early Pliocene 
climate: it was a warm climate characterized by a minimal increase in 
warm pool SST but substantially weaker meridional and zonal SST 
gradients, and, hence, weaker atmospheric circulation**”®*, and a 
deeper tropical thermocline. These differences between the early 
Pliocene and modern climates amount to structural climate change, 
regardless of whether we consider trends in mean temperatures or 
interglacial temperatures (the latter are less affected by continental ice 
sheets than are the former). 


7 
90° W o° 


Latitude 
s 


LEE SALE DLL SL LE DL LL LL SL GL GL 
90° E 180° 90° W o° 


Longitude 


0 1 2 3 4 5: 
Change in surface air temperature (°C) 


Figure 6 | Global surface air temperature anomalies in two experiments. 
a, CO) increase; b, Combined. The Combined experiment includes three 
different forcings: a 100-p.p.m. CO; increase, enhanced ocean mixing in the 
subtropical bands and reduced albedo of low clouds in the extratropics. Note 
the warming of the eastern equatorial Pacific, a reduction of the temperature 
gradient between the Equator and the subtropics (also see Supplementary 
Fig. 11), and a substantial warming of high latitudes. 
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Overall, we are just beginning to understand the driving forces 
behind climate evolution since the early Pliocene. The climatic changes 
occurred in concert with a relatively small reduction—less than 50- 
100 p.p.m.—in atmospheric CO, (Fig. 1c). Either this reduction pushed 
the system over a threshold, leading to the structural climate change, or 
the climate system was cooled by other mechanisms and CO) provided 
a positive feedback for the cooling. 

None of the currently proposed mechanisms can reproduce all of 
the key Pliocene climate features by itself. Some mechanisms address 
one aspect of the structural change, but often at the detriment of the 
other features; a major problem is to simulate reduced temperature 
gradients without raising warm pool SSTs. Increases in greenhouse 
gases alone are not sufficient, whereas shifts in oceanic gateways have 
much smaller effects than sometimes suggested. The most promising 
dynamical mechanisms involve lower extratropical cloud albedo or 
enhanced ocean mixing. For these mechanisms to explain the stark 
differences between the Pliocene and today’s climates, the corres- 
ponding model has to be modified. 

Imposing the two dynamical mechanisms in a climate model while 
increasing CO, (Fig. 6b) produces a warming by ~3 °C in the eastern 
equatorial Pacific and a reduction by 2-4 °C in the meridional tempera- 
ture gradient, leading to an increase of only 1 °C in the temperature 
of the warm pool. The spatial variance of SST's decreases by 60% in the 
equatorial region (Supplementary Table 4), in stark contrast with the 
CO,-only experiment (Fig. 6a). Thus, a combination of several factors 
may be necessary to explain Pliocene climate. 

An important question is to what degree the evidence of climate 
evolution over the last 5 Myr shapes our assessment of future change. 
From the observations, it is clear that the climate system is capable of 
remarkable transformations in response even to small changes in 
external parameters (CO, concentrations, continental boundaries 
and orography are generally similar for the early Pliocene and pre- 
sent). Therefore, explaining the discrepancy between model simula- 
tions and the early Pliocene temperature patterns is essential for 
building confidence in our climate projections. Finally, it might be 
necessary to contemplate, in addition to global mean temperature 
increase, the possibility of structural climate change (relaxation of 
prevailing temperature gradients at low latitudes). 


METHODS SUMMARY 


All the geochemical records used to compute the trends presented in this article 
have been previously published. The ocean temperatures come from two geo- 
chemical proxies (alkenone unsaturation index (U*' 57) and Mg/Ca ratios of 
planktonic foraminiferal shells) that are derived from material preserved in 
deep-sea sediment (Methods). These proxies are recorded by microorganisms 
living in the surface mixed layer of the ocean (except one record that comes from 
benthic foraminifers), where their chemical composition reflects the conditions 
of the surrounding water. The leading-order trends are computed as 400-kyr or 
600-kyr running means for each temperature record, which filters out glacial- 
interglacial variability. The original data are included in Supplementary Infor- 
mation. The CO, estimates were calculated using a variety of techniques 
(Supplementary Table 1) and are presented as originally published except the 
record based on 51°C (ref. 46), which was adjusted to subtract an offset (90 p.p.m.) 
between the original data and the Antarctic ice-core CO) records. 

The climate model used is the NCAR Community Earth System Model”. This 
incorporates an atmospheric model and a land surface model (CAM4 and CLM, 
respectively) with a spectral truncation of T31 (a horizontal grid of 3.75° by 3.75°) 
coupled to ocean and sea-ice components (POP2 and CICE) with a nominal 3° 
resolution that increases to 1° near the Equator. Each numerical experiment is 
designed to examine the effects of a single type of perturbation with respect to 
a preindustrial baseline simulation (a change in a particular ocean gateway or 
a change in cloud properties, for example). In the final experiment, titled 
‘Combined’, we imposed three different perturbations together (enhanced ocean 
mixing in the subtropical bands, reduced albedo of low extratropical clouds and a 
100-p.p.m. CO, increase). For each experiment, the simulation time was 500 yr, 
with the final 25 yr being used for analysis. Further details are provided in 
Methods, including a discussion of mechanisms related to ocean salinity and 
atmospheric superrotation”. 
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Full Methods and any associated references are available in the online version of 
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METHODS 


SST reconstructions. Two geochemical proxies, alkenone unsaturation index 
(U*',,) and Mg/Ca ratios of planktonic foraminiferal shells, derived from mater- 
ial preserved in deep-sea sediment, are now widely used to reconstruct ocean 
temperatures. Alkenones are long-chain ethyl and methyl] ketones that vary in the 
number of double bonds they have and are produced by coccolithophorids living 
in ocean surface waters. The degree of unsaturation in C37 alkenones reflects the 
temperature of the water in which these organisms lived at the time of production. 
Owing to their large size relative to other organic compounds, these molecules are 
resistant to degradation and are therefore well preserved in the sediment record. 
Several global calibration studies indicate a robust linear calibration between 
U*'3, and ocean surface temperature, confirming the global applicability of a 
single calibration equation (see ref. 53 for a review). All U5, values reported 
here are use the calibration from ref. 54 except for ODP site 882, which, owing to a 
significant abundance of C37.4, uses the U*,, index as initially reported’. Because 
the dominant producers of UX’ 3 in the modern ocean have been present only in 
the past 200 kyr, there was initially concern about the applicability of this proxy 
on longer timescales. However, numerous studies have shown that changes in 
U*'37 are not coincident with shifts in haptophyte algae species***”, indicating 
that producers of alkenones consistently record surface water conditions. 

Planktonic foraminifera secrete their CaCO; shell in the surface mixed layer, 
where warmer temperatures cause an exponential increase in the Mg/Ca ratio 
because more Mg substitutes for Ca in the shell under warmer conditions. 
Although shells are commonly well preserved in sediments above the lysocline, 
partial dissolution of the shells lowers the Mg content, potentially biasing SST 
estimates. This bias can be avoided by selecting well-preserved shell material or by 
using calibrations that include a correction for dissolution. The calibration used 
for any given site must be considered individually, as the preservation of the 
sample material will determine which calibration is most appropriate. We exam- 
ined the appropriateness of the calibrations used at each site in the original 
publications, and assessed the sensitivity of the resultant SST estimates to using 
or not using a dissolution correction where one had not been applied in the 
original study. In all cases, the differences in calibrations yielded estimates that 
differed from each other by less than 1 °C. In most cases, the results differed by 
less than 0.5 °C, suggesting that SST estimates for the sites examined here are not 
strongly influenced by dissolution. 

Four different calibrations were applied to different records in this study 
(Supplementary Table 3). The main differences between the calibrations are 
the presence or absence of a dissolution correction and how that dissolution 
correction is incorporated into the calibration. It is important to note that the 
calibrations are quite similar if preservation is good. For example, some calibra- 
tions use ACO3” or core depth to correct for dissolution®, but if preservation is 
excellent, and that correction is removed, the calibration is essentially the same 
as that of ref. 59, which does not include a dissolution correction. Therefore, 
calibration choice affects the absolute SST estimates but does not change the 
long-term trends of the records. Given that changing calibrations from those 
selected in the original publications yields very small differences in the absolute 
SST estimates and does not affect the long-term trends, we chose to use the 
originally applied calibrations at all sites except one. The Mg/Ca data from 
ODP site 709 overlap with data from ODP site 806, which is not unexpected given 
that both sites are in the warm pool and at similar water depths. However, the 
SSTs reported in ref. 60 are ~1.5°C cooler than those from ODP site 806. This 
could be because the dissolution correction applied to the data from ODP site 709 
is based on core tops from the Caribbean Sea®’, which may not be appropriate for 
an Indian Ocean site, such as site 709. We chose to apply the same ACO;7 - 
corrected calibration to ODP site 709 as was used for ODP site 806 because doing 
so yields a core-top estimate close to modern SST values in the region for ODP 
site 806 (ODP site 709 does not have core-top data). 

It has been suggested that Mg/Ca temperature measurements, which provide 
the majority of the warm pool data, should be corrected for secular changes in 
seawater composition’'. However, the general agreement between Mg/Ca and 
alkenone records in locations where both techniques are applicable indicates that 
this correction should be small, of the order of calibration errors!”, and it is 
therefore not applied here. 

Finally, we have chosen to omit Mg/Ca records from sites 999 and 1000 located 
in the Caribbean Sea, because these records do not clearly reflect open-ocean 
conditions’. We have also omitted site 1241 from our analysis, because this site 
has moved considerably because of tectonic processes (the SST at its present 
location is ~1.4°C higher than that at its Pliocene latitude). Additionally, the 
Mg/Ca calibration used in the original publication” is based on a culturing study 
that excludes a dissolution correction, which we feel should be incorporated at 
this location. 


The trends are computed as 400-kyr or 600-kyr running means for each tem- 
perature record, which were resampled to a 2-kyr spacing before computing the 
means (the original data are included in Supplementary Information). This 
approach filters out glacial-interglacial variability and highlights first-order 
trends in these records. Comparative analysis of different resampling resolutions 
and window sizes demonstrates that the first-order results reported here are 
insensitive to changes in these parameters. 

Computing the trends as we do in the main part of the paper, incorporating 
both glacial and interglacial intervals of the time series, is conventional in palaeo- 
climatology. Nevertheless, some of the inferences in this study actually concern 
differences between the late Quaternary interglacials and Pliocene interglacials. 
Therefore, to make sure that our conclusions about the structural climate change 
are robust regardless of how we compute the trends, we have compared the mean 
trends and those based just on interglacial intervals of the time series. 

The procedure for calculating ‘interglacial’ trends includes several steps. First, 
each time series is divided into separate intervals (bins). Typically, we choose bins 
of 41-kyr length, which corresponds to the dominant spectral component of the 
glacial cycles of the past 5 Myr (the one related to variations in Earth’s obliquity). 
Longer intervals are chosen for coarse time series. For each bin, we find the point 
with the maximum temperature and then interpolate these temperatures on the 
original grid of the time series. Finally, to compute the new trends, we apply 
400-kyr or 600-kyr running means to the new record. This procedure picks up 
maximum interglacial temperatures as well as points with relatively warm tem- 
peratures but lying outside the actual interglacials (specifically, those belonging to 
the warm interstadials of the past 1 Myr). We choose to retain such points because 
many of them are as warm as modern temperatures and also because we prefer to 
use bins of the same size throughout the entire length of the time series. Very 
rarely does this procedure pick up cold points erroneously, but the effect of such 
errors is minimized by computing the running means. Using larger bins, 80 kyr 
for example, would eliminate most of these points and raise the trend lines by a 
few tenths of a degree Celsius on average. 

As expected, the mean temperatures computed for interglacials (Supplemen- 
tary Fig. 1) turn out to be ~1°C warmer than those computed for the full time 
series, but the relative temperature changes (Supplementary Fig. 2) are remark- 
ably similar between the two methods, with differences typically below 0.5 °C, 
which confirms the robustness of our main conclusions. 

The SST data are grouped into four broad categories: waters of low- 

latitude warm pools'’**°**°° (Supplementary Fig. 4); upwelling zones in the deep 
tropics'’*°*”°, including equatorial upwelling sites and the Arabian Sea (Sup- 
plementary Fig. 5); subtropical coastal upwelling regions'”*°°””"” (Supplemen- 
tary Fig. 7); mid and high latitudes'*!°°***’*”*"* (Supplementary Fig. 6). There 
are also two sites in the South China Sea that currently lie outside the warm 
pool?*?*”’, but they seem to have been contained within it earlier (Supplementary 
Figs 8 and 9). The high productivity at upwelling locations makes them especially 
suitable for the creation of continuous records over the past 5 Myr, hence their 
prominence in this compilation. See Supplementary Information for both the full 
and the temporally averaged data. 
Data uncertainty and error bars. The two SST proxies we use (Mg/Ca and Ux 37) 
are widely applied in palaeoceanographic studies, but each carries inherent 
uncertainties. Although some uncertainties, such as ecological shifts for the 
U*’,, values and changes in seawater chemistry for Mg/Ca values, are difficult 
to quantify, several studies have now demonstrated that these are probably minor 
and that both proxies are reliable on Pliocene-Pleistocene timescales'***. Also, 
although the U*’ 3, index is insensitive to temperature change above ~29 °C, only 
three of the records included here reach this threshold value, and do so only for 
short durations. Comparison between these records and other SST proxies from 
the same sites indicates that temperatures during theses times were probably only 
marginally warmer than the threshold value, which should not affect the conclu- 
sions reached in this paper. Because analytical precision is excellent for both types 
of measurements, the dominant source of error in converting the primary data to 
SST estimates is in the calibration used. The calibration errors can be propagated 
to provide an estimate of the errors for each type of data. 

Although a detailed error analysis of each of the previously published proxy 
records included in this study is beyond the scope of this paper, we nonetheless 
compare four records from the tropical Pacific Ocean (Supplementary Fig. 10). 
The Mg/Ca and U*’ 7 records at ODP site 847 have previously been compared 
in detail’’, and here we compare those records from the eastern equatorial Pacific 
(EEP) with Mg/Ca and UX’, records from ODP site 806 in the western equatorial 
Pacific'’** (WEP). Because the data were not measured on the same samples 
within each core, and because of the different time resolutions of the data, 
we separated the data into 400-kyr bins. This bin size allows for reliable statis- 
tics but is not so large that any one bin encompasses a large portion of the long- 
term trends. 
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For both the Mg/Ca and U*’ ;,, the analytical error is essentially negligible and 
the dominant source of error is the calibration used to convert the primary data to 
a SST estimate. To assess the uncertainty in SST estimates (ossr), we propagated 
the uncertainties in the parameters of the proxy calibrations. For example, for 
UX’, the linear calibration is 


The uncertainties in a (¢,) and b (a) are reported in the original calibration, and 
can be propagated for each data point as follows: 


a af OSST\, of OSST\" 5 f OSST'\* 
Psst Fal Gq) + %\ ay) 17%, OUR, 
ae 37 


The same process was done for the Mg/Ca calibration (the same Mg/Ca cali- 
bration was used in the WEP and the EEP). The average propagated errors for 
Ue are 1.6°C, for ODP sites 847 and 806, and 1.7°C and 1.9°C, for ODP 
sites 847 and 806, respectively. The error bars in Supplementary Fig. 10 represent 
the uncertainties in the means calculated for each bin, given the propagated errors 
described above, calculated as 


Omean SST = oss /VN 


We note that the errors for U3, at ODP site 806 are larger owing to the lower- 
resolution data. 

Propagating the error in the calibration demonstrates several important points. 
First, the Mg/Ca and U*’3, records agree remarkably well in both the WEP and 
the EEP, in terms of both the long-term trend and the absolute temperature 
estimates. In the EEP, both records show a cooling by 3-4°C over the last 
4 Myr, and the error bars of the most recent three bins do not overlap with the 
errors of the oldest three bins, indicating that the EEP was significantly colder in 
the Pleistocene than during the early Pliocene climactic optimum. The situation is 
strikingly different in the WEP, where both Mg/Ca and U* 3, indicate that SST in 
the early Pliocene climatic optimum was not significantly warmer than the 
Pleistocene. Thus, the errors demonstrate that at both sites the two proxies, which 
have different sources of potential bias, agree remarkably well (Supplementary 
Fig. 10). This exercise confirms that SST estimates for the eastern and western 
tropical Pacific indeed overlap during the early Pliocene, although there is a clear 
gradient between the two locations in the Pleistocene. 

In addition, although there are uncertainties in absolute SST values for the 
Pliocene, planktonic foraminiferal assemblages and the 8180 of surface and sub- 
surface planktonic foraminifera were also similar in the western and eastern 
equatorial Pacific (refs 30,78 and references therein), providing strong supporting 
evidence for the reduced east-west gradients in oceanographic conditions during 
that time. 

Note that the isotopic oxygen ratio used in Fig. 1, where it is expressed in per 

mille, is defined as 8'°Oppg (%o) = [('8O/"°O) ampte/(“8O/"6O) ppg — 1] X 1,000 
(PDB, PeeDee Belemnite). 
Sensitivity experiments. All of the pertinent hypotheses, some tested in idealized 
or more comprehensive models before, have been re-examined within a single 
modelling framework to allow a better comparison of their relative impacts. This 
framework is based on the latest version of the NCAR Community Earth System 
Model*’ (CESM) at its relatively low resolution, which facilitates palaeoclimate 
simulations”. The model incorporates an atmospheric model and a land surface 
model with a spectral truncation of T31 (CAM4 and CLM) coupled to ocean and 
sea-ice components with a nominal 3° resolution (POP2 and CICE, respectively). 
The model has a cold bias that is especially pronounced in the Arctic”. 

The previous version of the model (CCSM3) has been used extensively for 
climate simulations that contributed, for example, to the IPCC Fourth Assess- 
ment Report’*; simulations with CESM will contribute to the next report. Further 
information about the model, including its source code and the boundary con- 
ditions are freely accessible via http://www.cesm.ucar.edu/models/cesm1.0. We 
use subversion 1.0.2, because earlier versions of the model were not scientifically 
validated at the low resolution. All the integrations start from a pre-existing 
preindustrial simulation” (available as b40.037 from http://www.cesm.ucar. 
edu/models/cesm1.0) and have been integrated for a further 500 yr (including 
the preindustrial control). This is sufficient for the upper ocean to reach equilib- 
rium with the deep ocean experiencing a weak residual drift. All simulations have 
orography, vegetation type and land-ice cover prescribed at preindustrial condi- 
tions and are designed to assess the roles of separate mechanisms rather than to 
reproduce the early Pliocene climate exactly. Previous work” shows that modi- 
fying these factors as appropriate for the Pliocene may increase global mean 
temperature by ~1°C, but contributes little to the structural climate change 
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described here. Figures 5 and 6 show the difference in the averaged climate over 
the final 25 years from this control simulation. 

Each numerical experiment examines the effects of a single type of perturba- 
tion with respect to the preindustrial baseline simulation (control), except for the 
final experiment, which combines three different perturbations. Supplementary 
Table 4 lists all the experiments and the simulated changes in global mean surface 
air temperature and temperature variances. The other details of the simulations 
are described below. 

The CO, experiment involves an instantaneous increase in atmospheric CO, 
concentration from 285 p.p.m. to 385 p.p.m., leading to an increase of 1°C in 
global mean air temperature. It is important that the base model has an equilib- 
rium climate sensitivity of 2.9 °C (ref. 80), in the middle of the typical range of 
model climate sensitivities. 

Bathymetric perturbations in the ocean gateway experiments involve the 
removal of five land points (coloured green in Fig. 5). In the shallow-CAS experi- 
ment, the bathymetry has been lowered to 150 m. In the other experiments, the 
bathymetry has been lowered to that of the model’s nearby open ocean: 580 m for 
the Indonesia experiment and 1,100 m for the deep-CAS experiment. The latter 
experiment shows the maximum possible signal of this particular tectonic change. 
Modified boundary conditions for the tectonic simulations can be provided on 
request. Variations of this experiment that have been performed previously were 
collated in ref. 41. Supplementary Fig. 12 shows how the SST changes in the 
present simulations compare with their compilation. 

In the enhanced ocean mixing experiment, the hurricane-induced mixing is 
treated as an increase in background vertical diffusivity of 1cm’s~' in two 
subtropical bands (8° N-40° N and 8° S-40° S) at depths down to the model level 
at 185 m (ref. 45; Supplementary Fig. 13b). The value of 1 cm” s-' was chosen as it 
is the annual mean diffusivity observed in regions of strong tropical cyclone 
activity®’. A recent study shows that imposing permanent mixing in the upper 
ocean has a largely similar effect as a transient mixing’. 

In the reduced-cloud-albedo experiment, the cloud liquid water path has been 
reduced by 20% polewards of 30°N and 30°S, but only for the purposes of 
calculating the shortwave radiation. This acts to reduce the albedo of highly 
reflective low stratus clouds (Supplementary Fig. 13a). As a result, global mean 
air surface temperature increases by 1.1 °C (Supplementary Table 4). One of the 
effects of this modification was an increase of 0.7 °C in the tropical West Pacific 
SST, which stopped us from imposing a greater reduction. We note that in this 
experiment we do not directly prescribe changes in cloud albedo, which is calcu- 
lated within the CESM. This is a different approach from that used to investigate 
sensitivity to cloud properties in ref. 44. Those authors used a coupled model of 
intermediate complexity with a specified cloud cover, which they reduced by 50% 
in their perturbation experiments. Here we simulated a range of reduced liquid 
water paths in the atmosphere, with a 20% reduction leading to the representation 
closest to that of ref. 44. The actual change for the early Pliocene, if any, is largely 
unconstrained. 

We note that the effects of changes in extratropical low clouds and enhanced 
ocean mixing on tropical climate seem to be robust in a variety of models. New 
experiments are now being conducted with high-resolution versions of CESM 
and other coupled models, which seem to confirm our previous conclusions (to be 
reported elsewhere). 

Among hypotheses that we have not tested, we should mention atmospheric 
superrotation”’. In such a state, common on gas giants such as Jupiter, atmo- 
spheric Rossby waves radiate polewards from the tropics, generating westerly 
winds along the Equator and suppressing equatorial upwelling. Recent studies*’, 
however, indicate that superrotation develops only at temperatures in excess of 
40 °C. It has also been argued that a stronger meridional salinity gradient (with 
more fresh water in the northern Pacific) could reduce oceanic meridional density 
gradient and collapse the tropical thermocline***’. This would require a strong 
increase in precipitation in high latitudes, for which evidence has yet to be found. 
Carbon dioxide estimates. The CO, estimates are calculated using a variety of 
techniques based on carbon isotopes”, boron isotopes***”, alkenones***’, boron/ 
calcium ratios**, stomata®’ and ice-core inclusions” (Supplementary Table 1). 
They are presented as originally published except for the record based on 8'°C 
(ref. 46), which was adjusted to subtract an offset (90 p.p.m.) between the original 
data and the Antarctic ice-core CO, records. 
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Intra-oceanic subduction shaped the 
assembly of Cordilleran North America 


Karin Sigloch' & Mitchell G. Mihalynuk? 


The western quarter of North America consists of accreted terranes—crustal blocks added over the past 200 million 
years—but the reason for this is unclear. The widely accepted explanation posits that the oceanic Farallon plate acted as a 
conveyor belt, sweeping terranes into the continental margin while subducting under it. Here we show that this 
hypothesis, which fails to explain many terrane complexities, is also inconsistent with new tomographic images of 
lower-mantle slabs, and with their locations relative to plate reconstructions. We offer a reinterpretation of North 
American palaeogeography and test it quantitatively: collision events are clearly recorded by slab geometry, and can 
be time calibrated and reconciled with plate reconstructions and surface geology. The seas west of Cretaceous North 
America must have resembled today’s western Pacific, strung with island arcs. All proto-Pacific plates initially 
subducted into almost stationary, intra-oceanic trenches, and accumulated below as massive vertical slab walls. 
Above the slabs, long-lived volcanic archipelagos and subduction complexes grew. Crustal accretion occurred when 
North America overrode the archipelagos, causing major episodes of Cordilleran mountain building. 


Continents grow through subduction magmatism and collision of arcs 
and other buoyant crustal fragments at their margins. Poorly under- 
stood, such collisions are of broad scientific interest because they cause 
rapid geographic changes, affecting climate, ocean circulation, biota 
and the formation of economically important metal deposits. North 
America was enlarged by a sequence of massive terrane collisions 
relatively recently (between 200 million years (Myr) ago and 50 Myr 
ago), which created the mountainous Cordillera of the American West!. 

Reconciling geological records on land with those of the ocean basins 
has proved difficult. Magnetic stripes on the sea floor are the basis of all 
quantitative plate tectonic reconstructions, and well-preserved Atlantic 
spreading records indicate that North America has moved westward 
continuously since the breakup of Pangaea (about 185 Myr ago), away 
from Africa and Europe’. In contrast, more than half of the seafloor 
records of proto-Pacific (Panthalassa) ocean spreading are missing. 
The Pacific plate records the existence of another major oceanic plate 
to its northeast since at least 180 Myr ago: the Farallon plate. This plate 
is usually assumed to have filled the eastern Panthalassa basin, extend- 
ing to the western margin of North America and subducting under it, 
although this is not required by the magnetic seafloor data. Hence, the 
Farallon plate, invoked as the causative agent in almost all major land 
geological events since late Jurassic times”’, should also have trans- 
ported the terranes to the continental margin. 

However, dozens of terranes have accreted to North America since 
200 Myr ago’, but not to the Andean margin of South America, sup- 
posedly a closely analogous setting. The terranes are mostly Triassic to 
Cretaceous island arc-subduction assemblages, but include three 
more heterogeneous superterranes: Intermontane (IMS)*°, Insular 
(INS)* and Guerrero (GUE)’, which are essentially microcontinents. 
Their exact origins remain mysterious, but the terranes formed at 
various latitudes and times, as inferred from palaeomagnetic observa- 
tions and fossil faunas, and pre-assembled at others. This implies the 
temporary existence of additional oceanic plates in the northeastern 
proto-Pacific Ocean’*, which are missing from quantitative plate 
reconstructions. 


The mantle retains a memory of ancient plate configurations, in the 
form of subducted slabs, which body-wave tomography images in situ 
as domains of faster-than-average seismic wave velocities. Beneath 
North America these slab relics are massive, almost vertical walls 
extending from 800 to 2,000 km in depth, and typically 400-600 km 
wide (Fig. 1, Supplementary Fig. 1). The largest wall runs from north- 
west Canada to the eastern USA and on to Central America, and has 
been called the Farallon slab, one of the most massive features in global 
tomographies”"’*. We argue that in fact most of this slab wall is not 
Farallon and subdivide it into the Angayucham (ANG), Mezcalera 
(MEZ) and the Southern Farallon (SF) components (Fig. 1b). This 
reinterpretation is based on our most recent tomography model’, 
which utilizes dense USArray data'* in addition to global network data, 
using a cutting-edge waveform inversion method: multiple-frequency 
P-wave tomography”. 

Besides putting the known eastern slab walls”"’” in sharper focus, 
we discovered’* that another, more westerly slab wall, the Casca- 
dia Root (CR in Fig. 1b, depth 700 km to more than 1,800 km), con- 
nects upward continuously to the present-day Cascadia trench, 
into which the last remnant of the Farallon (Juan de Fuca) plate 
is subducting. This makes CR a Farallon slab by definition, and 
prompted us to re-evaluate whether MEZ/ANG/SF do really consti- 
tute subducted Farallon Ocean floor, which to our knowledge has 
never been questioned. 

For plate reconstructions, the crucial question is whether (and 
how) these lower-mantle slab walls have moved laterally since they 
were deposited beneath their corresponding volcanic arcs (past versus 
present x-y positions in an absolute reference frame). Here we quan- 
titatively test the hypothesis that they have not moved appreciably, 
that is, there has been only vertical sinking within our observa- 
tional uncertainties (a few hundred kilometres laterally). Motivated 
primarily by imaged slab geometries, this null hypothesis also seems 
sensible in light of the Cenozoic subduction record, where absolute 
trench motion on average contributed only 10-30% of total plate 
convergence’®””, 
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Figure 1 | Slabs under North America and continental motion over time. 
a, Subducted slabs at and below 900 km depth. P-wave tomography model’? 
rendered as three-dimensional (3D) isosurface contours, which enclose faster- 
than-average structure (threshold dvp/vp = 0.25%, where vp is the P-wave 
velocity). Colour signifies depth and changes every 200 km; the scene is 
illuminated to convey 3D perspective. At a sinking rate of about 10mm yr’, 
this slab assemblage should have been deposited from about 200 Myr ago to 
90 Myr ago. Reconstructed continent positions at 140 Myr ago are shown in a 
hotspot reference frame”' and at 170 Myr ago in a hotspot/palaeomagnetic 


Slabs and arcs at stationary trenches 


Figure 2a and b shows how a steep, widened slab wall could be piled up 
by nearly vertical sinking beneath a long-lived, stationary trench and 
volcanic arc. An Andean-style west-coast trench could not have been 
stationary because North America moved westward as the Atlantic 
Ocean spread. This contradiction is resolved by westward intra-oceanic 
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Figure 2 | Schematic cross-section and evolution of a terrane station. t, 
denotes the time of arc-continent collision. Motions are shown in a lower- 
mantle reference frame. a, Well before the collision, both trench and arc are 
active. Slab buckling is due to the viscosity contrast around 670 km, but the 
backlog reaches into the upper mantle. b, Around ¢, and up to about 10 Myr 
later, the continent overrides the trench and accretes its arc terranes, while the 
slab breaks. c, Well after the collision, the slab wall continues to sink. Seaward, a 
new Andean-style subduction has developed. Anchored in the lower mantle, 
the slab wall is sinking vertically at a steady-state rate of approximately 
10mmyr ' in all three panels. 


hybrid frame’. The hatched area represents location uncertainty for 
continental margin during Jurassic/Cretaceous times; the cross-hatched area 
shows terranes that accreted during Cretaceous and early Tertiary times. 

b, Interpretative legend. The slab walls divide into four groups: Cascadia/ 
Northern Farallon slabs (blue) and Southern Farallon slabs (green), owing to 
eastward subduction; Angayucham (ANG, red) and Mezcalera (MEZ, orange) 
slabs, owing to westward subduction. Before 140 Myr ago, sizeable ocean basins 
separated North America from the ANG/MEZ trenches. 


subduction before the arrival of the continent, followed by a polarity 
switch of subduction to its current eastward motion into a continental- 
margin trench (Fig. 2c). Such a scenario implies that the imaged lower- 
mantle slabs MEZ/ANG/SF are Jurassic to Cretaceous in age, allowing 
the collision of North America with their subduction zones to cause the 
Cretaceous terrane accretions. 

To the extent that slabs sink vertically, they record palaeo-arc and 
trench locations in an absolute sense. Thus, vertical sinking permits 
quantitative predictions of the location and timing of continent-trench 
collisions when tomography and absolute plate reconstructions are 
combined. These predictions can be tested against the docking times 
of arc terranes inferred from land geology. Abrupt upward truncations 
of the slab walls, which are well resolved tomographically (Supplemen- 
tary Fig. 2), correspond to the shutdown of the overlying trench-arc 
systems, and hence to docking times (Fig. 2). 

If the trench remains stationary, a vertical slab pile is deposited 
beneath it. Ifthe trench moves (but every parcel of slab sinks vertically), 
the imaged slab will dip towards older trench locations, assuming no 
dramatic lateral variations in sinking rate. The observed lower-mantle 
walls are widened to 400-600 km laterally, that is, 4-6 times the thick- 
ness of oceanic lithosphere—this is not artificial blur, but the actual 
reason for their robust tomographic visibility'®. Thickening is probably 
achieved by slab folding above the 670-km viscosity jump, deviations 
from vertical sinking being due mainly to the folding process itself 
(Fig. 2). In convection models, slab folding occurs preferentially 
beneath the kind of stationary trenches postulated here'**°. Massive, 
thickened slabs like these can be expected to be the drivers of ‘mantle 
wind’, rather than blowing in it: that is, if anything sinks vertically, it 
should be these slabs. 

Such slab walls indicate that their overlying trenches remained in 
the same absolute locations for a long time, with arc and accretionary 
complex growth stationed above these locations. Observation of mas- 
sive slab walls leads us to think of their associated, intra-oceanic trenches 
as ‘terrane stations’ where new crustal material is gathered to await 
transfer to a continental margin. Terrane stations above ANG and 
MEZ were not conveyed eastward into a continental Farallon trench. 
Rather, North America migrated westward, collided, and accreted the 
ANG and MEZ terrane stations. Hence slab walls tie the now-displaced 
terranes toa laterally unchanged subsurface, constraining absolute loca- 
tions and temporal evolution of oceanic trenches more than a hundred 
million years after their demise. 
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Figure 1 shows the reconstructed western margin of North 
America”', together with schematic outlines of the lower-mantle slab 
walls from tomography’’. North America’s relative westward motion 
is well constrained by the Atlantic spreading record”, independently 
of any absolute reference frame. Comparison to the basic geometry of 
ANG and MEZ suggests that these two slabs did not subduct beneath 
the continental margin: (1) the slabs are vertical from 800 to 2,000 km 
depth, indicating a stationary trench, whereas the margin moved 
westward continuously. (2) The outlines of ANG and MEZ, especially 
the pronounced eastward promontory of MEZ, do not match the 
outlines of the continental margin (Fig. 1). If continental subduction 
had controlled slab deposition, then slab curvature should reflect the 
curvature of the continent. (3) West of ANG/MEZ, the slab is smeared 
out laterally in the upper 800 km (Supplementary Fig. 1), as might be 
expected from a trench dragged along by a migrating continent: direct 
observational evidence for a switch in subduction mode after override, 
from stationary oceanic to migrating continental. 


Quantitative prediction of arc accretion 


Continent motions in Figs 1 and 3 are tied to an absolute hotspot 
reference frame”! and rendered by the palaeo-geographic information 
system GPlates**. Like the vertically sinking slab walls, vertically 
rising plumes are thought not to have significant lateral motions rela- 
tive to the lowermost mantle*® (smaller deviations are correctable’), 
so that the hotspot and ‘slab wall’ reference frames are equivalent 
(no relative motion). In this merged reference frame, the lateral 
overlay of North America’s reconstructed western margin with a slab 
wall amounts to a spatiotemporal prediction of trench override and 
terrane accretion. 

For example, Fig. 1a shows the North American margin overriding 
point Al, eastern promontory of the MEZ arc, sometime before 
140 Myr, probably around 150 Myr ago. Slab sinking velocities can 
be inferred: the shallow end of MEZ beneath A1 is seen to have sunk to 
a depth of approximately 1,500 km, implying an averaged sinking rate 
of 10mmyr ‘. The MEZ promontory shallows to the southwest. This 
is unrelated to trench polarity (the slab did not dip northeastward), 
but rather reflects differential sinking times (subduction at Al was 
choked off earlier than at A2). Sinking rates could be estimated from 
any well-resolved point on the upward truncation of a slab wall, but 
we choose five points Al-A5 that are associated with supporting 
evidence from land geology (Table 1 and Fig. 3). Predicted override 
ages are Jurassic-Cretaceous (146-55 Myr ago), becoming progres- 
sively younger westward, and truncation depths shallow to the west as 
expected. Sinking rate estimates range between 9 and 12mmyr ' 
(+2mmyr '), consistent with findings of 12 + 3 mmyr ‘ globally”. 
Figure 3 renders the override sequence in four time slices, each show- 
ing only slabs that should already have been deposited at the time, 
assuming the sinking rate was 10 mmyr ' (the average of Al-A5 
in Table 1). 

The CR must be a Farallon slab because the Farallon (Juan de Fuca) 
plate is still subducting into it today’’. Pacific seafloor records indicate 
continuous Farallon spreading since about 180 Myr ago*”*, so that at 
a sinking rate of 10mm yr, the over-1,800-km-deep CR accounts 
for the entire lifetime of the (northern) Farallon plate. The presence 
of this CR slab implies that the equally deep and thickened ANG 
slab further east cannot represent Farallon lithosphere, as has been 
assumed”*"*78, Rather, the ANG slab must have dipped in the oppo- 
site direction (southwestward) in order to have sourced sufficient plate 
material from an ocean basin that lay to the northeast, the consump- 
tion of which accommodated the westward drift of North America. 
This scenario for transporting North America away from the former 
Pangaea provides an alternative to westward rollback of a continental 
Farallon trench. 

Hence our inferred trench/plate evolution in Fig. 3 differs fun- 
damentally from the commonly accepted scenario of MEZ/ANG as 
products of east-dipping, Farallon-beneath-continent subduction. 
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Westward subduction of the ANG and MEZ slabs, probably since 
early Jurassic times, consumed the ocean that bounded North 
America on the west (the ANG and MEZ basins in Fig. 3). Both basins 
were consumed in a zipper-like fashion: MEZ closed from north to 
south, ANG from south to north, as North America gradually over- 
rode and shut down the ANG/MEZ arcs between 150 and 50 Myr ago. 
Further west, the early Farallon Ocean subducted into two east-west- 
oriented Cascadia slabs (CR/CR2), but established additional seg- 
ments SF1/SF2 after a clockwise rotation at about 147 Myr ago”. 
Thus at northerly latitudes, two long-lived terrane stations of opposite 
polarity coexisted in ANG and CR, a variant not considered in Fig. 2. 
Upon override, the east-dipping Farallon trenches started rolling back 
with North America. Moderate complexities in the Pacific—-Farallon 
spreading record*”® probably reflect the transitions of individual 
trench segments from intra-oceanic to Andean-style. 


Supporting evidence from land geology 


We now use the terrane-station property of oceanic trench/arc sys- 
tems to test archipelago override predictions made by tomography 
and plate reconstructions. The collision of North America with buo- 
yant arc crust should coincide with observed deformation and accre- 
tion events. 

Figure 3b shows inferred terrane locations before override: each 
active trench/arc system may include a subduction complex or exotic 
fragments. Using geological relationships in the present-day Cordillera’, 
we can match most hypothesized terranes with actual ones. ANG 
terranes (red) now make up the interior of Alaska, in fault contact with 
the Angayucham and related ophiolites. Those studying Alaska have 
long inferred a southwest-dipping subduction”. 

Green terranes west of Al represent the Franciscan subduction 
complex of present-day California. Two superterranes from earlier 
subduction had already loosely accreted to North America before 
archipelago override began: the IMS closest to the continent**, and 
the GUE to the south®, whereas the INS superterrane* probably 
provided the subduction nucleation for the MEZ arc. 

To provide for independent validation, our calibration points for 
sinking rates were chosen at tectonic events that are sharply defined in 
time and space (A1-A5 in Table 1). Events B1-B5, which are inter- 
leaved with the Al-A5 events, represent widespread Cordilleran oro- 
genic and accretion episodes, demonstrating explanatory power on a 
continental scale. Four stages of override are distinguished, as follows. 

Stage 1 (see Fig. 3a, b) is the beginning of the override of the east- 
verging MEZ promontory. Deformation was initially localized to the 
Pacific Northwest, as predicted by our model. Incipient deformation 
of the hinterland generated molasse that flooded the continental 
platform about 157 Myr ago between 45 and 55° N (ref. 30). A flip 
in subduction direction at about 165 Myr ago’ is recorded by the 
transition from proto-Franciscan formation (for example, the Red 
Ant formation”, shown as orange terranes southwest of Al in Fig. 3b) 
to Franciscan formation (green), marking an early subduction hand- 
over from MEZ to SF2. 

Stage 2 (Figs 3c, d) is the time of margin-wide orogenies as North 
America collided with an increasingly wide swath of MEZ/SF. This 
caused the Sevier and Canadian Rocky Mountains orogenesis since 
around 125 Myr ago. Inboard parts of IMS were partly constructed on 
top of stable North American crust in southern California*’ and had 
largely collapsed by 110 Myr ago, increasingly shedding zircons onto 
stable North America*‘, and vice versa; whereas IMS and the active 
Sierra Nevada arc shed zircons into the Franciscan trench** (SF2). 
Intrusion of Omenica magmatic belts successively eastward into 
northern IMS and adjacent displaced North American strata* (Bl, 
about 124-90 Myr ago) can be attributed to prolonged override of 
the MEZ promontory. 

Stage 3 (Fig. 3e, f) is when North America entered the Farallon 
hemisphere. As ANG collided obliquely, its terranes (red; now inter- 
ior Alaska) were accreted along the Canadian margin. Override of A3 
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Figure 3 | Sequence of trench overrides and terrane accretions. The left slice renders slab below 1,400 km depth; the 110-Myr-ago slice renders slab 
column (a, ¢, e, g) shows time-depth slices at t= 140, 110, 75 and 55 Myr ago; —_ below 1,100 km; the 75-Myr-ago slice below 750 km; and the 55-Myr-ago 
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Each slice shows only material that should have been deposited by that time, maps in the right column (b, d, f, h) are interpretative cartoons showing 
that is, slab at and below a depth of v X f, where v = 10 mm yr | istheassumed the evolution of inferred trench and terrane geometries. Yellow stars 
sinking rate. All slabs are coloured according to their current depths, but mark the tectonic events of Table 1. SR, Shatsky Rise; SRC, Shatsky 
mentally their upper truncations should be migrated up to the surface, Rise Conjugate. 

representing the inferred active arc locations at each time. The 140-Myr-ago 
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Table 1 | Sequence of archipelago override 


Event Geometric/kinematic event description Matched geological validation event Reconstructed time, Slab depth, Slab sinking velocity, 
t (Myr ago) d (km) v(mmyr~?) 

Al __ Start override of MEZ promontory. Initiation of Rocky Mountain deformation, recorded 146+ 24 1,500 + 100 1022 
Overridden segment is replaced by by synorogenic clastic wedge (160-155 Myr ago). 
incipient South Farallon trench SF2. Initiation of Franciscan subduction complex/South 

Farallon (165-155 Myr ago). 

Bl Gradual override of MEZ promontory Omenica magmatic belts in Pacific Northwest - - - 
by North America (Pacific Northwest). (124-90 Myr ago). 

A2 End override of MEZ promontory - 111+8 1,050 + 50 O21 
(shallowest point of slab wall). 

B2 Widening collision of North America margin Margin-wide strong deformation: Sevier and Canadian - - - 
with archipelago (MEZ/ANG/SF arcs). Rocky Mountains (since about 125 Myr ago). 

A3 Override of ANG arc, followed by slab Carmacks volcanic episode due to slab window 7447 850 + 50 12+1 
window. (72-69 Myr ago). 

A4 — South Farallon trench steps westward after Sonora volcanism due to slab window: Tarahumara 88+3 800 + 50 92] 
accretion of Shatsky Rise Conjugate plateau. ignimbrite province (85 + 5 Myr ago). 

B3 Strong transpressive coupling of Farallon Laramide orogeny, basement uplift more than - - - 
plate to superterranes as buoyant Shatsky Rise 1,000 km inland (85-55 Myr ago). 

B4 Conjugate subducts. Northward shuffle of INS, IMS, and ANG terranes along - - - 

margin (85-55 Myr ago). 
A5 Override of CR arc by Pacific Last terrane accretions: Siletzia, Pacific Rim 55+7 600 + 30 12+2 


Northwest USA 
B5 Final override of westernmost ANG. 


(55-50 Myr ago). 


(55-50 Myr ago). 


Explosive end of Coast Mountain arc volcanism - - - 


Column 2 describes tectonic events in terms of geometrically predicted slab-margin interactions. Column 3 describes matching events from the land geological record. For well-localized events (A1-A5), we 
estimate slab depth d, time t since last subduction (margin override), and slab sinking rate v, from the tomographic and plate models. Calculations are explained in the Methods; for uncertainty analysis see the 
Supplementary Information. Geologically observed timing does not enter the calculations, since the role of the geological events is to validate the geometrically inferred results. Events B1-B5 represent additional, 
first-order tectonic episodes explained by the scenario of archipelago override (these are not localized enough spatiotemporally to estimate d, t and v). 


was accompanied by a strong pulse of intermediate to basaltic volca- 
nism, the Carmacks formation at around 72-69 Myr ago”. Such high- 
temperature, mainly primitive volcanism arises from juxtaposition of 
hot, sub-slab asthenosphere after a slab, broken through arc collision 
(Fig. 2b), has started to sink’®. 

The South Farallon trench was migrating westward with North 
America, still building the Fransciscan formation. Around A4, slab 
geometry indicates an outboard step from SF2 to SF3. This coincides 
with a strong regional pulse of ignimbrite volcanism at 90 Myr ago 
(Tarahumara formation, up to 4km thick and 400km inland*””**) 
as the southern California/Sonora margin traverses the intermittent 
slab gap. 

The location and timing of this trench step-back coincide almost 
perfectly with the inferred arrival of a buoyant oceanic Farallon plat- 
eau at the North America margin, the conjugate half of the Shatsky 
Rise (Fig. 3 includes its reconstruction by ref. 39). Plateau collision is a 
proposed mechanism**° for choking subduction and causing the 
basement uplifts of the Laramide orogeny at around 85-55 Myr ago 
(B3). We suggest that the event explains another first-order obser- 
vation of Cordilleran geology (B4): the INS/IMS and ANG terrane 
packages, but not Franciscan and GUE, were rapidly shuffled north- 
ward along the margin between 85 and 55 Myr ago, by many hun- 
dreds to over 2,000km (ref. 41). The convergence vector of the 
Farallon plate did have a large northward component at the time’, 
but terrane transport additionally requires strong coupling to the 
Farallon plate and decoupling from North America. A buoyant 
Farallon plateau, compressed against INS/IMS and unable to subduct, 
could have achieved such coupling until the trench re-established 
itself further west. 

Stage 4 (Fig. 3g, h) is the end of archipelago override. North 
America overrides point A5 at 55 + 7 Myr ago, in excellent agreement 
with last observed terrane accretions in the Pacific Northwest (Siletzia, 
Metchosin and Pacific Rim terranes at about 55-50 Myr ago”). The 
trench stepped west (clear upward truncation at A5) as the terranes 
accreted, converting intra-oceanic CR into today’s continental Cas- 
cadia subduction. Also at about 55-50 Myr ago, ANG was terminally 
overridden (B5), accompanied by explosive volcanism as the Coast 
Mountain arc of British Columbia shut down. 

By then, slab complexity in the upper mantle rivalled today’s west- 
ern Pacific—not surprising given the numerous forced reorganiza- 
tions. The simple depth-age relationship suggested by Fig. 3 need not 
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apply to flat-lying Cenozoic slabs’* or to isolated deeper fragments like 
Kula or CR3, because all sinking estimates were calibrated on slab 
walls. We do not attempt to interpret upper-mantle structure and the 
Cenozoic land record here, but our archipelago model provides a new 
framework for doing so. 


Better constraints on surface and mantle 


Long-lived, stationary oceanic trenches explain two problematic 
observations, previously thought to be unconnected: the near-vertical 
geometries of the super-slabs under North America (without invok- 
ing exceptional mantle rheologies or ad hoc shifts in absolute ref- 
erence frame), and the long series of arc terrane accretions during 
Cretaceous times! (not explained by the South American analogy). 

An archipelago offshore Mesozoic North America had previously 
been suggested on the basis of land geology”®, but lacked the absolute 
spatial constraints provided by seismic tomography and our vertical 
sinking/terrane stations concept. Now-displaced terranes can be tied to 
their original, seismically imaged trench locations, but also to recon- 
structed continent locations, in an absolute reference frame. The hypo- 
thesis that continental collision with stationed terranes caused the 
various episodes of Cordilleran mountain building becomes testable. 

Before 150 Myr ago, North America was clearly located too far east 
for MEZ/ANG to pass for continental trenches, even when longit- 
udinal uncertainties of absolute reference frames* are factored in 
(Fig. 1 and Supplementary Fig. 3, and Supplementary Tables 1 and 
2). The apparently ‘wrong’ geometry and locations of the vertical slab 
walls—under the hypothesis of subduction of the Farallon plate 
beneath North America—had been recognized’**”**” with two kinds 
of solutions suggested. A longitudinal shift of the global lithospheric 
shell relative to the lower mantle”’, specifically a Cretaceous westward 
excursion that tapered down as the Atlantic opened, could have held 
North America stationary above MEZ/ANG. Alternatively, upper- 
mantle slab, spread out laterally by a west-coast trench, must some- 
how have aggregated into steep piles when transitioning into the 
lower mantle. This requires lateral displacements by over 1,000 km 
of huge volumes of slab*”“°. Some convection simulations have pro- 
duced such behaviour***’, whereas others suggest essentially vertical 
sinking”, as do newer observations'*’*"*. 

We showed that, within observational uncertainty limits, predicted 
and observed geological events are consistent. This validates simple 
vertical sinking and seems to explain all North American observations, 
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including accreted terranes, but is incompatible with the widely 
accepted continental Farallon trench since 175 Myr ago or before. 
The observed proportionality between slab depth and time since over- 
ride (consistent sinking rates across all three slab walls) is not required 
for our argument, but rather increases confidence in its correctness. 

Vertical slab sinking provides much tighter constraints on palaeo- 
geographic reconstructions than arbitrarily movable slabs. This sce- 
nario follows the principle of parsimony, so that future investigations 
should start with it, and seek observations requiring a departure from 
it. The equivalence of hotspot and “slab wall” absolute reference 
frames, which is implied by vertical sinking, is of great interest because 
slabs reach back farther in time than hotspot tracks (200 Myr ago or 
more’*”’ versus about 130 Myr ago*’”’), and they constrain absolute 
palaeo-longitude, which palaeomagnetic data alone cannot*’. However, 
to quantitatively realize a global subduction reference frame”, it will be 
necessary to re-examine whether trenches commonly assumed to have 
been continental were not actually intra-oceanic. 


METHODS SUMMARY 


We postulate that subduction into all slab walls imaged tomographically beneath 
North America’ originated before the arrival of North America’s western 
margin. Hence we must demonstrate sufficiently old slab ages, and cessation or 
flipping of subduction when the continent overrode the slabs. Plate reconstruc- 
tions predict the timing of margin arrival above a slab, but only if slab’* and plate 
reconstructions'**'**°°*? can be linked to the same absolute reference frame. 
Hence override predictions are correct to the extent that slab walls sank vertically, 
meaning that their x-y locations since subduction are unchanged in a linked 
hotspot reference frame. Uncertainty is best quantified from slab wall geometry 
itself: deviation from vertical sinking probably did not exceed a wall’s half-width 
(200-300 km), else such steep geometries could not have built up over a long time. 
Uncertainties about absolute locations of North America’s palaeo-margin arise 
from imperfections in plate reconstructions. Owing to terrane accretions and 
removals, there is also uncertainty about the shape and westward extent of North 
America’s margin, compared to its present-day outlines. We discuss individual 
uncertainties in the Supplementary Information, and propagate them into cumu- 
lative uncertainties for the times at which North America’s palaeo-margin over- 
rode selected points Al-A5 on the palaeo-trenches. With relative uncertainties 
of only 10-15% (Table 1 and Supplementary Tables 1 and 2), this yields the old 
(Jura—Cretaceous) slab ages required to support intra-oceanic subduction. 
Spatiotemporal predictions of trench override are verified by terrane observa- 
tions: override should coincide with observable collision events, because buoyant 
island arcs or plateaus are overridden. Uncertainties on terrane observations are 
difficult to quantify, but particularly characteristic events (AI-A5) can none- 
theless be singled out and used successfully for validation (Table 1). Clear upward 
truncations of all slab walls offer direct observational evidence for continental 
override of oceanic trenches, and are used to calculate slab-wall sinking rates. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


The MEZ/ANG slab walls have been among the most robust features in global- 
scale body-wave tomography, starting with the work of Grand’ **. The deep end 
of the CR slab was already visible in some of the earlier studies. Its continuous 
upward connection to present-day Cascadia subduction was pointed out by 
ref. 15, hence its identification as a Farallon slab. The model on which we base 
our discussion here’ is an inversion of P-wave observations recorded by North 
American broadband stations, using a cutting-edge waveform inversion tech- 
nique (multi-frequency tomography) on a global, adaptive grid. Method discus- 
sion and formal resolution tests are presented in ref. 13. The higher resolution 
compared to global tomography models is largely due to densely spaced stations 
from the USArray experiment in the western half of the US, and waveforms 
recorded 2005-2008, which were not included in any of the above global models. 
Our calibration points for sinking rate, especially A3-A5, lie within the mantle 
subvolume that considerably benefits from resolution improvements afforded 
by the new USArray data. 

We postulate that subduction into all slab walls imaged tomographically 
beneath North America’? originated before the arrival of North America’s west- 
ern margin. Hence we must demonstrate sufficiently old slab ages, and cessation 
or flipping of subduction when the continent overrode the slabs. Plate reconstruc- 
tions predict the timing of margin arrival above a slab, but only if slab and plate 
reconstructions'**'”*°°*? can be linked to the same absolute reference frame. 
(Technically we accomplish this with the free community palaeo-geographic 
information system GPlates*~* and a compilation of digitally published reference 
frames*’.) Hence override predictions are correct to the extent that slab walls sank 
vertically, meaning that their x-y locations since subduction are unchanged in a 
linked hotspot reference frame. Uncertainty is best quantified from slab wall 
geometry itself: deviation from vertical sinking probably did not exceed a wall’s 
half-width (200-300 km), else such steep geometries could not have built up over 
a long time. 

Uncertainties about absolute locations of North America’s palaeo-margin arise 
from imperfections in plate reconstructions. Owing to terrane accretions and 
removals, there is also uncertainty about the shape and westward extent of 
North America’s margin, compared to its present-day outlines. We discuss indi- 
vidual uncertainties in the Supplementary Information, and propagate them into 
cumulative uncertainties for the times at which North America’s palaeo-margin 
overrode selected points Al-A5 on the palaeo-trenches. With relative uncer- 
tainties of only 10-15% (Table 1 and Supplementary Tables 1 and 2), this yields 
the old (Jura—Cretaceous) slab ages required to support intra-oceanic subduction. 

Spatiotemporal predictions of trench override are verified by terrane observa- 
tions as follows. Override should coincide with observable collision events, 
because buoyant island arcs or plateaus are overridden. Uncertainties on terrane 
observations are difficult to quantify, but particularly characteristic events (Al- 
A5) can nonetheless be singled out and used successfully for validation (Table 1). 
Sinking-rate calculations provide an additional plausibility check. Consistent 
results of 9-12 mmyr | (+1-2mm yr‘) across all three slab walls show that 
this type of feature seems to sink rather predictably and evenly. If this is the case, 
then conversely the override prediction times that lead to the rate estimates 
should be adequate. 

Sinking at about 10 mmyr’’ is considered a lower-mantle sinking rate, but we 
obtain it as an average over both upper and lower mantle. This is explicable if a 
wall were to have sunk in steady state: upon subduction of its youngest end 
(Fig. 2b), its lower part would already have entered the viscous lower mantle 
and would have been setting the speed limit from below, which would have acted 
on the entire pile. Such a wall, widened to 4-6 times the lithospheric thickness 
(400-600 km), and sinking at about 10mm yr; generates the same material 
throughput as the typical 40-60mm yr | of unbuckled plate convergence in 
the uppermost mantle’®”°—a plausibility check that confirms the continuity of 
upper and lower mantle fluxes. 

Sinking rates are obtained by dividing the imaged depth of a wall’s shallowest 
end (depth reached since the end of subduction) by predicted time since the 
trench override. Besides timing uncertainty, a spatial uncertainty about the slab’s 
true depth enters, as discussed quantitatively in Supplementary Fig. 2. These 
uncertainties are comparatively small, because upward truncations of the slab 
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walls are sharply imaged. This is also a striking visual feature (see, for example, 
Fig. 3g or Supplementary Fig. 2): along their lengths, MEZ and ANG show abrupt 
upward truncations in red to orange, yellow and green depth levels. This provides 
direct observational evidence for westward subduction—after trench override, 
there was no slab left to subduct. In contrast, the less-complete upward truncation 
of eastward-dipping CR around A5 (cyan colour level, a more localized slab 
window) indicates ‘only’ a larger terrane accretion event and trench step-back 
when the margin transitioned from intra-oceanic to Andean-style. Upward trun- 
cations shallow to the west (A1 at red level, A2 at yellow level, A3/A4 at green 
level, and A5 at cyan level), reflecting more and more recent ages for termination 
of subduction. 

In summary, the observational uncertainties that we discuss in the Sup- 
plementary Information are: 

(1) Uncertainties in plate reconstructions at the surface. When exactly did the 
North American palaeo-margin overlie a given point in the reference frame, for 
example, Al-A5? This includes relative reconstruction uncertainties (essentially 
ambiguities about the Atlantic Ocean opening; they are small, and neglected 
in our calculations); uncertainties about absolute reference frame (these are 
considerable; we attempt to quantify them by comparing different reference 
frames'**!??5°°-*? (Supplementary Fig. 3); and uncertainties about the true west- 
ward extent of the palaeo-margin over time. The latter uncertainty is most 
difficult to quantify, since it requires knowledge about accreted terrane locations, 
which shifted over time. Our best guess of the uncertain area’s extent is hatched in 
Fig. 1a and b and Supplementary Fig. 3a and b. Margin uncertainty is the biggest 
contributor to reconstruction uncertainties (typically 6-7 Myr), except for the 
oldest point Al, where absolute reference frame uncertainty dominates (Sup- 
plementary Table 2). 

(2) Uncertainties in palaeo-trench locations, relative to the slab. Were trench 
points Al-A5 centred on the imaged slab walls, systematically offset to one side, 
or oscillating? We assume that the trenches ran centred, for lack of evidence to 
the contrary. A constant offset would hardly change the calculations, producing 
a strong correlation with margin uncertainty rather than independent uncer- 
tainties. Periodic trench advance and retreat—as buckling folds are being laid 
down—cannot be excluded, but we are unaware of observational evidence. 
Unless these oscillations significantly exceeded the half-width of the slab wall of 
200-300 km (unlikely, given the tall slab piles), they would not dominate over 
margin uncertainty. 

(3) Uncertainties about present-day slab depth (pertinent only to sinking-rate 
estimates). Evaluation of tomographic blur yields significantly smaller relative 
uncertainties than reconstruction errors (Supplementary Table 1). The two types 
of relative errors enter symmetrically into cumulative uncertainty on sinking rate. 

Regarding our hypothesis of intra-oceanic trenches, the most important, 
qualitative assessment of uncertainty is that all the plate reconstructions 
considered*!”***"°** (with one exception” discussed below) agree that two ocean 
basins should have existed between North America and the MEZ/ANG slabs 
before 140 Myr ago (Fig. la and Supplementary Fig. 3a). These oceans were 
considerably wider than the uncertainty in margin extent, implying intra-oceanic 
subduction origins for all imaged slab walls, provided they are older than 
140 Myr. That this is the case is shown by good agreement between predicted 
and geologically observed collision events (within the moderate error bars of 
Table 1). An extrapolation of sinking rates of about 10 mm yr’ to the lower ends 
of the slab walls at over 1,800 km depth implies that subduction originated at least 
180 Myr ago. 

The one reference frame’ that does not predict wide ANG/MEZ ocean basins 
in the late Jurassic/early Cretaceous was explicitly designed to minimize their 
extents, by imposing a constraint that keeps North America’s western margin 
stationary above the MEZ/ANG slabs (that is, an Andean-style margin is enforced 
a priori; SUB on Supplementary Fig. 3b). This is accomplished by introducing an 
additional degree of freedom, an ad hoc, otherwise non-observable, westward 
shift of the lithospheric shell relative to the lower mantle. In solving for this 
longitudinal shift, the method considers a global slab inventory, but for the 
times discussed here, the influence of the MEZ/ANG slabs dominates. Hence, 
this reference frame does not lend itself to evaluating the existence of the 
ANG/MEZ oceans. 
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The genomes of four tapeworm species 
reveal adaptations to parasitism 
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Tapeworms (Cestoda) cause neglected diseases that can be fatal and are difficult to treat, owing to inefficient drugs. Here 
we present an analysis of tapeworm genome sequences using the human-infective species Echinococcus multilocularis, 
E. granulosus, Taenia solium and the laboratory model Hymenolepis microstoma as examples. The 115- to 141-megabase 
genomes offer insights into the evolution of parasitism. Synteny is maintained with distantly related blood flukes but we 
find extreme losses of genes and pathways that are ubiquitous in other animals, including 34 homeobox families and 
several determinants of stem cell fate. Tapeworms have specialized detoxification pathways, metabolism that is finely 
tuned to rely on nutrients scavenged from their hosts, and species-specific expansions of non-canonical heat shock 
proteins and families of known antigens. We identify new potential drug targets, including some on which existing 
pharmaceuticals may act. The genomes provide a rich resource to underpin the development of urgently needed 


treatments and control. 


Echinococcosis (hydatid disease) and cysticercosis, caused by the pro- 
liferation of larval tapeworms in vital organs’, are among the most 
severe parasitic diseases in humans and account for 2 of the 17 neglected 
tropical diseases prioritized by the World Health Organization’. Larval 
tapeworms can persist asymptomatically in a human host for decades’, 
eventually causing a spectrum of debilitating pathologies and death’. 
When diagnosed, the disease is often at an advanced stage at which 
surgery is no longer an option*. Tapeworm infections are highly pre- 
valent worldwide’, and their human disease burden has been estimated 
at 1 million disability-adjusted life years, comparable with African 
trypanosomiasis, river blindness and dengue fever. Furthermore, cystic 
echinococcosis in livestock causes an annual loss of US$2 billion’. 

Tapeworms (Platyhelminthes, Cestoda) are passively transmitted 
between hosts and parasitize virtually every vertebrate species’. Their 
morphological adaptations to parasitism include the absence of a gut, 
a head and light-sensing organs, and they possess a unique surface 
(tegument) that is able to withstand host-stomach acid and bile but is 
still penetrable enough to absorb nutrients’. 


Tapeworms are the only one of three major groups of worms that 
parasitize humans, the others being flukes (Trematoda) and round 
worms (Nematoda), for which no genome sequence has been available 
so far. Here we present a high-quality reference tapeworm genome of 
a human-infective fox tapeworm Echinococcus multilocularis. We also 
present the genomes of three other species, for comparison; E. granu- 
losus (dog tapeworm), Taenia solium (pork tapeworm), both of which 
infect humans, and Hymenolepis microstoma (a rodent tapeworm and 
laboratory model for the human parasite Hymenolepis nana). We have 
mined the genomes to provide a starting point for developing urgently 
needed therapeutic measures against tapeworms and other parasitic 
flatworms. Access to the complete genomes of several tapeworms will 
accelerate the pace at which new tools and treatments to combat tape- 
worm infections can be discovered. 


The genomes and genes of tapeworms 


The E. multilocularis genome assembly was finished manually (Sup- 
plementary Information, section 2), producing a high-quality reference 
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genome in which 89% of the sequence is contained in 9 chromo- 
some scaffolds that have only 23 gaps (Supplementary Table 1.2). 
One chromosome is complete from telomere to telomere, and 13 of 
the expected 18 telomeres are joined to scaffolds (Fig. 1a). This quality 
and completeness is comparable to that of the first publications of 
Caenorhabditis elegans and Drosophila melanogaster genomes*”. 
The 115- to 141-megabase (Mb) nuclear tapeworm genomes were 
sequenced using several high-throughput technologies (Supplemen- 
tary Table 1.1). The tapeworm genomes are approximately one-third 
of the size of the genome of their distant flatworm relative, the blood 
fluke Schistosoma mansoni", mainly because it has fewer repeats 
(Supplementary Information, section 3). By sequencing several iso- 
lates of E. multilocularis (Supplementary Table 3.2), we revealed tetra- 
ploidy in protoscoleces of one isolate, and a trisomy of chromosome 9 
(the smallest chromosome, and possibly the only one for which a 
trisomy is tolerated) transiently exhibited in protoscoleces and meta- 
cestodes from two different isolates (Fig. 1c, dand Supplementary Figs 
3.1, 3.2 and 3.3), consistent with previous observations of karyotype 
plasticity in flatworms". 

Aided by deep transcriptome sequencing from multiple life-cycle 
stages, we identified 10,231 to 12,490 putative genes per genome (Sup- 
plementary Table 5.5). Similar to the genome of S. mansoni’, distinct 
‘micro-exon genes’ are present in tapeworm genomes, with multiple 
internal exons that are small (typically less than 36 bases) and divisible 
by 3 (Supplementary Information, section 5). To identify gene gain 
and loss in tapeworms, orthologous relationships were predicted 
between tapeworms and eight other species (Fig. 2). Although gene 
order has been lost, ancient chromosomal synteny is preserved among 
parasitic flatworms (Fig. 1b and Supplementary Table 7.3). Two chro- 
mosomes in E. multilocularis (Fig. 1a, b) correspond to the S. mansoni Z 
sex chromosome. Schistosomes are unusual among flatworms in that 
they have sexual dimorphism, but how common ancestors of both 
tapeworms and flukes evolved into female-heterogametic parasites, like 
S. mansoni, remains to be elucidated. 
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isolates are shown in Supplementary Fig. 3.1. 

d, Distribution of minor allele frequency (MAF) 
of heterozygous sites in five isolates of 

E. multilocularis (plot for individual isolates in 
Supplementary Fig. 3.1), identified by mapping 
sequencing reads against the assembled 
chromosome consensus sequences. At each site, 
the proportion of bases that disagree with the 
reference is counted. For four isolates, the MAF 
peaks at around 0.5, indicative of diploidy, whereas 
JAVA05/1 peaks at 0.25, suggesting tetraploidy. 
Chromosome 9 of GT 10/2 is plotted separately 
(marked by asterisk) from chromosomes 1 to 8, 
and the MAF display a clear departure of 0.5 and 
peaks around 0.33, consistent with a trisomy. 
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Genome-wide identification of polycistrons in tapeworms shows 
that there are 308 putative polycistrons in E. multilocularis, with the 
largest containing 4 genes. The internal gene order within E. multilocu- 
laris polycistrons is largely the same as in T. solium and H. microstoma 
(Supplementary Table 6.5), and—to some extent—as in flukes; 39% of 
S. mansoni orthologues of genes within E. multilocularis polycistrons 
retain colinearity. Of these S. mansoni genes, 40% have transcriptome 
evidence supporting their polycistronic transcription’®, demonstrating 
further that gene order in polycistrons is highly conserved over long evo- 
lutionary time’* (P < 0.0001, Supplementary Information, section 6). 

Polycistrons are resolved into individual coding transcripts using 
spliced-leader trans-splicing, but spliced-leader trans-splicing also 
occurs in genes outside of polycistrons. Using deep transcriptome 
sequencing (RNA-seq) we found evidence of spliced-leader trans- 
splicing in approximately 13% of E. multilocularis genes (Supplemen- 
tary Table 6.2), less than the 70% observed in C. elegans'* and 58% in 
a tunicate’’. 


Specialized metabolism and detoxification 

The high-confidence gene sets reveal extensive reductions in overall 
metabolic capability and an increased ability to absorb nutrients, com- 
pared to that of other animals (Figs 2 and 3, and Supplementary 
Information, section9). Their main energy source, carbohydrates, 
can be catabolized by aerobic respiration or by two complementary 
anaerobic pathways; the lactate fermentation and malate dismuta- 
tion pathways. The parasiticidal effects of mitochondrial fumarate 
reductase inhibitors have been demonstrated in vitro, suggesting that 
the malate dismutation pathway would be an effective target for the 
development of novel therapeutics’®. 

Tapeworms, like flukes, lack the ability to synthesize fatty acids and 
cholesterol de novo'”"’. Instead, they scavenge essential fats from the 
host using fatty acid transporters and lipid elongation enzymes (Sup- 
plementary Table 9.2), as well as several tapeworm-specific gene families, 
such as fatty acid binding protein (FABP) and the apolipoprotein 
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antigen B (Supplementary Information, section 8). Uptake of fatty 
acids seems to be crucial in Echinococcus spp. metacestodes, in which 
both FABP and antigen B gene families are among the most highly 
expressed genes’’ (Supplementary Table 5.7). Tapeworms and flukes 
have lost many genes associated with the peroxisome (Supplementary 
Information, section 8), an organelle in which fatty acid oxidation 
occurs, and may lack peroxisomes altogether, as seen in several other 
parasites””. 

Compared with other animals, S. mansoni has a reduced ability to 
synthesize amino acids’’. In tapeworms, this capacity is reduced 
further, with serine and proline biosynthesis enzymes absent from 
E. multilocularis (Fig. 3 and Supplementary Information, section 9). 
Many enzymes in the molybdopterin biosynthesis pathway seemed to 
be lost in tapeworms, along with enzymes that use molybdopterin as a 
cofactor. The ability to utilize molybdenum in enzymatic reactions 
was believed to be present in all animals’', but has been lost in some 
eukaryotic parasites”. 

Differences in the detoxification systems between tapeworms and 
their mammalian hosts may be exploited for drug design (Sup- 
plementary Information, section 9). We found that, like flukes”’, tape- 
worms typically have only one cytochrome P450 gene, suggesting that 
their ability to oxidize many xenobiotics and steroids is substantially 
lower than that of their hosts. Uniquely, tapeworms and flukes have 
merged two key enzymatic functions for redox homeostasis in one 
single enzyme: thioredoxin glutathione reductase (TGR). TGR is an 
essential gene and validated drug target in flukes**. Downstream of 
TGR we find an unexpected diversity of thioredoxins, glutaredoxins 
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Figure 2 | Evolution of tapeworm parasitism. 
Phylogeny of the main branches of Bilateria; 
Ecdysozoa (including fruitflies and nematodes), 
Deuterostomia (including lancelet, zebrafish, mice 
and humans), and Lophotrochozoans (including 
Platyhelminthes (flatworms)) (based on phylogeny 
in Supplementary Fig. 7.1). The gains and losses of 
life-cycle traits for these parasitic flatworms include 
the evolution of endoparasitism (a), passive 
transmission between hosts (b), acquisition of 
vertebrate intermediate host (c), ability to 
proliferate asexually in intermediate host (d). 
Morphological traits that have evolved include the 
loss of eye cups (e), gain of neodermatan syncytial 
epithelia (f), loss of gut (g), segmentation of body 
plan (h), and changes in the laminated layer (to 
contain specialized apomucins; i). Gains and losses 
of genomic traits include spliced-leader trans- 
splicing (1), loss of Wnt genes (2), loss of NEK 
kinases, fatty acid biosynthesis and ParaHox genes 
(3), anaerobic metabolic ability through the malate 
dismutation/rodhoquinone pathway, merger of 
glutaredoxin and thioredoxin reductase to 
thioredoxin glutathione reductase (TGR) (4), 
evolution of tapeworm- and fluke-specific 
Argonaute (Ago) family, micro exon genes (MEGs) 
and PROF1 GPCRs (5), loss of peroxisomal genes 
(6), and complete loss of vasa, tudor and piwi genes, 
NF-«B pathway, loss of 24 homeobox gene families 
(indicated by “H’), metabolic proteases and amino 
acid biosynthesis (7). In tapeworms, gains and 
losses of genomic traits include innovation of 
bimodal intron distribution and novel fatty acid 
transporters (8), expansion of mu-class glutathione 
S-transferases, GP50 antigens and tetraspanins (9), 
loss of the molybdopterin biosynthesis pathway, 
loss of 10 homeobox gene families (10), fewer 
GPCRs and fewer neuropeptides encoded by each 
protopeptide (11), and expansion of heat shock 
proteins (Hsp) and species-specific antigens (12). 


Intermediate host 
Definitive host 


& New genome 


and mu-class glutathione S-transferases (GSTs) (Supplementary 
Table 9.3). The GST expansion suggests that tapeworms would be able 
to water-solubilize and excrete a large range of hydrophobic com- 
pounds, which may add complexity to the pharmacokinetics of drugs. 


Homeobox gene loss 


Homeobox genes are high-level transcription factors that are impli- 
cated in the patterning of body plans in animals. Across parasitic 
flatworms, the homeobox gene numbers are extensively reduced 
(Supplementary Table 10.1). Most bilaterian invertebrates have a 
conserved set of approximately 100 homeobox genes (for example, 
92 conserved in C. elegans, 102 in D. melanogaster, and 133 in the 
lancelet)*’. Of the 96 homeobox gene families that are thought to have 
existed at the origin of the Bilateria, 24 are not present in tapeworms 
and flukes, and a further 10 were lost in tapeworms, making their 
complement by far the most reduced of any studied bilaterian animal”. 
Among the tapeworm-specific gene losses are gene families involved 
in neural development (mnx, pax3/7, gbx, hbn and rax). This is some- 
what surprising considering that tapeworms possess a well-developed 
nervous system, albeit with reduced sensory input and cephalization. 
Tapeworms also lack the ParaHox genes (gsx, pdx, cdx) ancestrally 
involved in specification of a through-gut’®”’, although these seem to 
have been lost before the tapeworm gut was lost. Other conserved genes 
found in bilaterian developmental pathways such as Hedgehog and 
Notch were found to be present and intact, although the Wnt com- 
plement is greatly reduced compared to the ancestral (spiralian) 
complement of 12 Wnt ligands** (Supplementary Table 10.2). 
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Em Total 
Super-pathway Pathway ECs 
Alanine, aspartate and glutamate metabolism 10 43 
Arginine and proline metabolism 14 103 
Cysteine and methionine metabolism 7 64 
Glycine, serine and threonine metabolism 6 58 
Histidine metabolism 4 37 
A . Lysine biosynthesis 1 31 
Amino ace Lysine degradation 8 54 
metabolism Phenylalanine metabolism 4 59 
Phenylalanine, tyrosine and tryptophan biosynthesis 1 32 
Tryptophan metabolism 10 68 
Tyrosine metabolism 4 65 
Valine, leucine and isoleucine biosynthesis 4 18 
Valine, leucine and isoleucine degradation 7 34 
Amino sugar and nucleotide sugar metabolism 20 96 
Citrate cycle (TCA cycle) 15 22 
Fructose and mannose metabolism BT) 65 
Galactose metabolism 10 37 
Carbohydrate Glycolysis/gluconeogenesis 22 45 
metabolism __!nositol phosphate metabolism 16 41 
Oxidative phosphorylation 8 12 
Pentose and glucuronate interconversions 5 60 
Pentose phosphatepathway 16 37 
Propanoate metabolism 9 47 
Pyruvate metabolism 16 64 
Starch and sucrose metabolism 10 71 
a-Linolenic acid metabolism 2 16 
Arachidonic acid metabolism 4 29 
Biosynthesis of unsaturated fatty acids 2 15 
Ether lipid metabolism 3 27 
Fatty acid biosynthesis 3 21 
Lipid Fatty acid metabolism £ 29 
metabolism Glycerolipid metabolism 8 36 
Glycerophospholipid metabolism 17 52 
Linoleic acid metabolism 2 ah) 
Primary bile acid biosynthesis 1 18 
Steroid biosynthesis 1 26 
Steroid hormone biosynthesis 2 38 
Folate biosynthesis vd 16 
Nicotinate and nicotinamide metabolism 6 47 
Metabolism of One carbon pool by folate 4 24 
cofactors and Pantothenate and CoA biosynthesis 7 31 
vitamins Riboflavin metabolism 2 21 
Thiamine metabolism 3 16 
Vitamin B6 metabolism 2 26 


Stem cell specializations 

Extreme regenerative capability and developmental plasticity, 
mediated by ever-present somatic stem cells (neoblasts), have made 
flatworms popular models for stem cell research”. All multicellular 
organisms rely on stem cells for proliferation and growth, so it is 
remarkable that tapeworms and flukes appear to lack the ubiquitous 
stem cell marker gene vasa (Supplementary Information, section 11). 
Instead tapeworms have two copies of another dead-box helicase 
(PL10), which we propose may have taken over some of the functions 
of vasa (Supplementary Fig. 11.1). Tapeworms and flukes are also 
missing the piwi gene subfamily and piwi-interacting tudor-domain 
containing proteins. The piwi genes belong to a subfamily of genes 
encoding argonaute proteins, and we also found that tapeworms have 
a new subfamily of argonaute proteins (Supplementary Fig. 11.2) that 
may bind a newly discovered potential small RNA precursor*®. Both 
piwi and vasa are usually essential in regulating the fate of germline 
stem cells in animals, and vasa suppression usually leads to infertility 
or death*’. These findings suggest that stem-cell-associated pathways 
in parasitic flatworms may be highly modified. 


Specialization of the tapeworm proteome 

We sought to identify novel and expanded gene families in tape- 
worms, and found many frequently occurring novel domains 
involved in cell-cell adhesion and the formation of the tegument 
(Supplementary Information, section 8). For example, several novel 
domains are found on the ectodomain of cadherins (Supplementary 
Information, section 8), and tapeworms have proportionally more 
tetraspanin copies (30-36) (Supplementary Table 12.1) than the 
highly expanded repertoires of fruitflies and zebrafish”. The acellular 
carbohydrate-rich laminated layer, which coats the outside of Echino- 
coccus metacestodes, is a unique genus-specific trait and one of the 
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_- Figure 3 | Conservation of individual metabolic 

100% pathways. Heatmap showing the conservation of 
individual metabolic pathways for E. multilocularis 
(Em), E. granulosus (Eg), T. solium (Ts), H. 
microstoma (Hm) and S. mansoni (Sm) compared 
to those of humans (Hs) and mice (Mm). Each row 
indicates an individual metabolic pathway grouped 
by their superclass membership (defined by KEGG 
(Kyoto Encyclopedia of Genes and Genomes)). 
Coloured tiles indicate the level of conservation 
(percentage of enzymes detected) of each pathway 
within each species. KEGG pathways with 
insufficient evidence (that is, containing only one 
enzyme) in E. multilocularis have been removed. 
CoA, coenzyme A; EC, enzyme commission 
number; TCA, tricarboxylic acid cycle. 


few morphological traits that differ between the very closely related 
species E. granulosus and E. multilocularis. We identified correspond- 
ing species differences in an Echinococcus-specific apomucin family 
(Supplementary Fig. 12.1), an important building block of the lami- 
nated layer*’. One particular copy is highly differentiated between the 
two species (non-synonymous to synonymous substitution ratio of 
>1) and is the fifth most highly expressed in the metacestode stage 
of E. multilocularis (Supplementary Table 5.7). Galactosyltransferases 
that probably decorate the apomucins with galactose residues, the 
predominant sugar of laminated layer glycans, are similarly diverged* 
(Supplementary Information, section 8). Approximately 20% of the 
genes are exclusive to tapeworms, and these include many highly 
expressed antigen families, such as antigen B, the glycosylphosphatidyl- 
inositol (GPI)-anchored protein GP50 (ref. 34), and the vaccine target 
EG95 (ref. 35) (Supplementary Table 12.4). 

One of the most striking gene family expansions in the tapeworm 
genomes is the heat shock protein 70 (Hsp70) family. Phylogenetic 
analysis revealed independent and parallel expansions in both the 
Hsp110 and the cytosolic Hsp70 clades (Fig. 4). Several examples of 
expansions exist at various clades of Hsp70 in other systems, includ- 
ing Hsp110 expansions in oysters (to cope with temperature) and in 
cancer cells (to cope with proteotoxic stress)***”. Echinococcus and 
T. solium have the highest number of gene expansions in the cytosolic 
Hsp70 clade. These expansions seem to have occurred independently 
in each species, and have resulted in 22 to 32 full copies in each species 
(Echinococcus and T. solium) compared to 6 copies in fruitflies and 
2in humans (Fig. 4). This expanded clade lacks classical cytosolic 
Hsp70 features (a conserved EEVD motif for substrate binding and 
a GGMP repeat unit), and whereas the canonical cytosolic hsp70 genes 
are constitutively expressed in different life-cycle stages, the non- 
canonical genes show almost no expression, suggesting a putative 
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Figure 4 | Heat shock protein 70 expansions in tapeworms. Rooted tree of 
Hsp70 sequences from tapeworms and the eight comparator species used in 
this study, with additional sequences from baker’s yeast Saccharomyces 
cerevisiae, and the Pacific oyster Crassostrea gigas (a non-flatworm example ofa 
lophotrochozoan with a recently reported Hsp70 expansion). Different Hsp70 
subfamilies are shown in different colours. Dotted red lines, E. multilocularis 
hsp70 genes that are located in the subtelomeres. EEVD, the conserved carboxy- 
terminal residues of a canonical cytosolic Hsp70; ER Hsp70, endoplasmic 
reticulum Hsp70. 


contingency role in which individual copies of the expanded family 
are only highly expressed under certain conditions (Supplementary 
Fig. 12.2). At least 40% of E. multilocularis hsp70-like genes are found 
within the subtelomeric regions of chromosomes, including the extreme 
case of chromosome 8 in which eight copies (including pseudogenes) 
are located in the subtelomere (Supplementary Table 12.2). No other 
genes are over-represented in these regions. Although Hsp70 proteins 
have been found in excretory-secretory products of tapeworms”, it 
remains to be determined whether the non-canonical Hsps have a 
host-interacting role or whether telomere proximity is important for 
their function or expression. 


Table 1 | Top 20 promising targets in E. multilocularis 
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Novel drug targets 


Tapeworm cysts are treated by chemotherapy or surgical intervention 
depending on tapeworm species, patient health and the site of the cyst. 
The only widely used drugs to treat tapeworm cysts are benzimidazoles”” 
that, owing to considerable side effects, are administered at parasiti- 
static rather than parasiticidal concentrations*. Novel targets and 
compound classes are therefore urgently needed. 

To identify new potential drug targets, we surveyed common targets 
of existing pharmaceuticals; kinases, proteases, G-protein-coupled 
receptors (GPCRs) and ion channels*’. We identified approximately 
250 to 300 new protein kinases (Supplementary Table 13.1), and these 
cover most major classes (Supplementary Information, section 13). We 
also identified 151 proteases and 63 peptidase-like proteins in E. multi- 
locularis, a repertoire of similar diversity to S. mansoni, and found that, 
like S. mansoni, E. multilocularis has strongly reduced copy numbers 
compared to those of other animals (Supplementary Table 13.9). Many 
successful anthelminthic drugs target one of several different forms of 
neural communication”. We therefore mapped the signalling path- 
ways of the serotonin and acetylcholine neurotransmitters, predicted 
conserved and novel neuropeptides (Supplementary Table 13.6), and 
classified more than 60 putative GPCRs (Supplementary Table 13.2) 
and 31 ligand-gated ion channels (Supplementary Table 13.4). A voltage- 
gated calcium channel subunit**—the proposed target of praziquantel— 
is not expressed in cysts and thus provides a putative explanation for 
the drug’s low efficacy. 

We searched databases for potential features for target selection, 
including compounds associated with protein targets and expression 
in the clinically relevant metacestode life-stage, and using this informa- 
tion we assigned weights to rank the entire proteomes (Supplementary 
Table 13.10). We identified 1,082 E. multilocularis proteins as potential 
targets, and of these, 150 to 200 with the highest scores have available 
chemical leads (known drug or approved compounds). 

Acetylcholinesterases, which are inhibited by mefloquine (an anti- 
malarial that reduces egg production in S. mansoni), are high on the 
list of potential targets**. However, acetylcholinesterase transcription 
in tapeworm cysts is low, possibly limiting their suitability. After 
filtering to remove targets with common substrates rather than inhi- 
bitors, the top of the list includes several homologues of targets for 


Target category Target Action Expression Drug Rank 
Current targets Tubulin B-chain Cytoskeleton MA Albendazole 406 
Voltage-dependent calcium channel on transport - Praziquantel 277 
Potential target Thioredoxin glutathione reductase (TGR) Detoxification MA Experimental compounds 277 
Top predicted targets Fatty acid amide hydrolase Bioactive lipid catabolism M Thiopental, propofol 1 
Adenine nucleotide translocator itochondrial ATP export M Clodronate 2 
nosine 5' monophosphate dehydrogenase Purine biosynthesis M Mycophenolic acid, ribavirin 3 
Succinate semialdehyde dehydrogenase GABA catabolism M Chlormerodrin 3 
Ribonucleoside diphosphate reductase Purine biosynthesis MA Motexafin gadolinium 5 
Casein kinase Il Cell-cycle regulating kinase MA Experimental compounds 6 
Hypoxanthine guanine Purine biosynthesis MA Azathioprine 8 
phosphoribosyltransferase 
Glycogen synthase kinase 3 ultiple signalling pathways MA Lithium 8 
Proteasome subunit Protein degradation M.A Bortezomib 16 
Calmodulin Transduces calcium signals MA Trifluoperazine 19 
FK506 binding protein Protein folding MA Pimecrolimus 19 
UMP-CMP kinase Phosphorylases M Gemcitabine 39 
ribonucleotides 
a‘/K* ATPase on transport M Artemether 42 
Carbonic anhydrase II Acidity control M Multiple (for example, 42 
Methazolamide) 
ADH dehydrogenase subunit 1 Energy metabolism M Multiple (for example, 42 
Methoxyflurane) 
Translocator protein Multiple functions MA Multiple (for example, 42 
Lorazepam) 
Elongation factor 2 Translation MA Experimental compounds 54 
Cathepsin B Protease M Experimental compounds 55 
Dual-specificity mitogen activated protein Signalling, activation of p38 M Experimental compounds 56 
Purine nucleoside phosphorylase Purine metabolism MA Didanosine 63 


A, adult; M, metacestode. Rank is sorted starting from the highest overall score; proteins with tied scores have the same rank. For current targets, the rank is only reported from the highest-scoring protein family 


member. For full scores and information please see Supplementary Table 13.10. 
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cancer chemotherapy, including casein kinase II, ribonucleoside reduc- 
tase, UMP-CMP kinase and proteasome subunits (Table 1). The chal- 
lenges of inhibiting cancer tumours and metacestodes (particularly 
those of E. multilocularis) with drugs are in some ways similar; both 
show uncontrolled proliferation, invasion and metastasis, and are 
difficult to kill without causing damage to the surrounding tissue. 
Therefore, metacestodes may be vulnerable to similar strategies as 
cancer; suppression of mitosis, induction of apoptosis and prevention 
of DNA replication. In fact, the anthelminthic medicines niclosamide, 
mebendazole and albendazole have already been shown to inhibit 
cancer growth™. 


Conclusion 


Tapeworms were among the first known parasites of humans, 
recorded by Hippocrates and Aristotle in ~300 Bc (ref. 45), but a safe 
and efficient cure to larval tapeworm infection in humans has yet to 
be found. These genomes provide hundreds of potential drug targets 
that can be tested using high-throughput drug screenings that 
were made possible by recent advances in axenic and cell culturing 
techniques****’, Flatworms display an unusually high degree of 
developmental plasticity. In this study, the high level of sequence 
completion enabled both gene losses and gains to be accurately deter- 
mined, and has shown how this plasticity has been put to use in the 
evolution of tapeworms. 


METHODS SUMMARY 


Genome sequencing was carried out using a combination of platforms. RNA 
sequencing was performed with Illumina RNA-seq protocols (for E. multilocu- 
laris, E. granulosus and H. microstoma) or capillary sequencing of full-length 
complementary DNA libraries (T. solium). The complete genome annotation is 
available at http://www.genedb.org. The tapeworm genome projects were regis- 
tered under the INSDC project IDs PRJEB122 (E. multilocularis), PRJEB121 
(E. granulosus), PRJEB124 (H. microstoma) and PRJNA16816 (T. solium). 
Sequence data for T. solium isolate (from Mexico) were used for all orthologue 
comparisons, but results relating to gene gains and losses were reconciled against 
an additional sequenced isolate from China (unpublished). All experiments 
involving jirds (laboratory host of E. multilocularis) were carried out in accord- 
ance with European and German regulations relating to the protection of animals. 
Ethical approval of the study was obtained from the ethics committee of the 
government of Lower Franconia (621-2531.01-2/05). Experiments with dogs 
(host of E. multilocularis sample RNA-seq ERS018054) were conducted accord- 
ing to the Swiss guidelines for animal experimentation and approved by the 
Cantonal Veterinary Office of Zurich prior to the start of the study, and were 
carried out with facility-born animals at the experimental units of the Vetsuisse 
Faculty in Zurich (permission numbers 40/2009 and 03/2010). A licensed hunter 
hunted the fox (host of E. multilocularis sample RNA-seq ERS018053) during 
the regular hunting season. Hymenolepis parasites were reared using laboratory 
mice in accordance with project license PPL 70/7150, granted to P.D.O. by the 
UK Home Office. 
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SCF’ ®*'” ubiquitin ligase targets 
cryptochromes at their cofactor pocket 


Weiman Xing’?, Luca Busino?, Thomas R. Hinds', Samuel T. Marionni*, Nabiha H. Saifee', Matthew F. Bush*, Michele Pagano*? 
& Ning Zheng! 


The cryptochrome (CRY) flavoproteins act as blue-light receptors in plants and insects, but perform light-independent 
functions at the core of the mammalian circadian clock. To drive clock oscillations, mammalian CRYs associate with the 
Period proteins (PERs) and together inhibit the transcription of their own genes. The SCF*®*'* ubiquitin ligase complex 
controls this negative feedback loop by promoting CRY ubiquitination and degradation. However, the molecular 
mechanisms of their interactions and the functional role of flavin adenine dinucleotide (FAD) binding in CRYs remain 
poorly understood. Here we report crystal structures of mammalian CRY2 in its apo, FAD-bound and FBXL3- 
SKP1-complexed forms. Distinct from other cryptochromes of known structures, mammalian CRY2 binds FAD 
dynamically with an open cofactor pocket. Notably, the F-box protein FBXL3 captures CRY2 by simultaneously 
occupying its FAD-binding pocket with a conserved carboxy-terminal tail and burying its PER-binding interface. 
This novel F-box-protein-substrate bipartite interaction is susceptible to disruption by both FAD and PERs, 
suggesting a new avenue for pharmacological targeting of the complex and a multifaceted regulatory mechanism of 


CRY ubiquitination. 


Cryptochromes (CRYs) are evolutionarily conserved FAD-binding 
proteins that share close sequence homology with DNA photolyases 
and are widespread in both plant and animal kingdoms'’. Using FAD 
as a chromophore, plant cryptochromes function as blue-light photo- 
receptors and regulate a broad range of light responses during plant 
growth and development'”. In animals, cryptochromes are classified 
into two types*°. Type I cryptochromes, exemplified by Drosophila 
melanogaster CRY, maintain blue-light sensitivity and have a key role 
in photic entrainment of the insect circadian clock®, whereas type II 
cryptochromes are photo-insensitive and evolved as central compo- 
nents of the molecular clock’®. 

Mammalian CRYs, the prototypic type II cryptochromes, act 
together with PERs, CLOCK and BMALI (brain and muscle aryl 
hydrocarbon receptor nuclear translocator-like protein 1) to consti- 
tute a transcription negative feedback loop that oscillates with a cir- 
cadian periodicity of approximately 24h (ref. 9). In this molecular 
clockwork, CLOCK and BMALI1 heterodimerize and activate the 
transcription of CRY and PER genes, whose protein products in turn 
form complexes in the cytoplasm and translocate into the nucleus 
to suppress their own gene expression by binding and inhibiting 
CLOCK-BMALI (refs 10, 11). In mammals, this basic clock circuitry 
drives the circadian oscillations of both the master clock in the brain 
and the peripheral clocks throughout the body”””. The master clock is 
synchronized to light-dark cycles perceived by the retina and coordi- 
nates clocks in the peripheral tissues through a variety of mecha- 
nisms, such as feeding behaviour, body temperature and hormonal 
secretion’*». 

Despite their functional divergence, many plant and animal crypto- 
chromes are functionally linked to protein ubiquitination and de- 
gradation. For example, light-activated Arabidopsis thaliana CRY1 
mediates seedling photomorphogenesis by inhibiting COP1 ubiquitin 
ligase’*’’, whereas Drosophila CRY modulates the insect circadian 


rhythm by promoting light-dependent degradation of the clock 
protein Timeless (TIM) and itself via the Jetlag ubiquitin ligase’*””. 
In both cases, photo-excitation of FAD is thought to induce confor- 
mational changes within the cryptochrome proteins and alter their 
interactions with the target proteins’. Recent biochemical and gene- 
tic studies have shown that mammalian CRYs are also functionally 
connected to the ubiquitin—-proteasome system”. The SCF’ ”*"® ubi- 
quitin ligase complex (SKP1-CUL1-F-box protein complex contain- 
ing the F-box protein FBXL3) has been identified to regulate the clock 
by promoting the degradation of CRYs. 

As an integral part of the clock circuitry, SCF'?™"?-catalysed CRY 
degradation needs to be regulated tightly. Indeed, the AMP-activated 
protein kinase (AMPK), a cellular metabolic sensor, has been shown 
to phosphorylate mouse CRY1 and accelerate its degradation by 
enhancing CRY1 binding to FBXL3 (ref. 25). Moreover, a recent 
circadian chemical screen has identified a period-lengthening small 
molecule that can compete with FAD for CRY binding and stabilize 
CRYs by inhibiting their ubiquitination”. These activities of the com- 
pound suggest that SCF'®*'’-mediated CRY ubiquitination may be 
tuneable by the FAD cofactor, whose function in mammalian CRYs 
remains elusive’. To provide the missing structural framework for 
understanding the functions of mammalian CRYs and the regulatory 
mechanisms of CRY-FBXL3 interaction, we report here the crystal 
structures of mammalian CRY2 in three different functional forms. 


Structure of murine apo CRY2 PHR 


Cryptochromes in general contain a conserved photolyase-homology 
region (PHR) and a unique cryptochrome C-terminal (CCT) exten- 
sion (CCE) (Supplementary Fig. la). We first determined the 27A 
crystal structure of the proteolytically stable mouse CRY2 PHR 
domain (amino acids 1-512) (Supplementary Fig. 2 and Supplementary 
Table 1). As expected, murine CRY PHR adopts a canonical photolyase 
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fold, consisting of an amino-terminal «/$ domain (N-terminal domain 
containing both «-helices and B-sheets), a C-terminal o-helical domain, 
and a connecting linker sequence (Fig. 1a). At the sequence level, verte- 
brate CRYs are more homologous to metazoan (6-4) photolyases than 
plant or type I animal cryptochromes (Supplementary Table 2). After 
superposition, murine CRY2 PHR can be aligned with the most homo- 
logous structure of Drosophila (6-4) photolyase with a root mean squared 
deviation (r.m.s.d.) of 1.1A over 431 equivalent Cox positions (Sup- 
plementary Fig. 3). However, unlike all known structures of (6-4) photo- 
lyases and cryptochromes”, the murine CRY2 PHR structure represents 
the apo form of the flavoprotein (Fig. 1a). We found no electron density of 
FAD at its expected cofactor-binding pocket, despite the well-defined and 
conserved FAD-binding cavity**. This feature is in agreement with pre- 
vious reports showing that only a small fraction of mammalian CRYs 
contains FAD when isolated from mammalian cells or various heterolog- 
ous systems’. Similar to Drosophila (6-4) photolyase, murine CRY2 PHR 
does not co-purify with a second cofactor that acts as a light-harvesting 
antenna in some bacterial photolyases””**". 


Structure of FAD-bound murine CRY2 PHR 


Oxidized FAD is naturally fluorescent with an emission maximum at 
approximately 525 nm. In the presence of excess murine CRY2 PHR, 
FAD fluorescence is strongly quenched, suggesting that murine CRY2 
PHR still retains specific FAD-binding activity (Fig. 1b). By titrating 
the amount of protein added, we estimated that FAD binds to isolated 
CRY2 PHR with an apparent dissociation constant (Kg) of approxi- 
mately 40 1M (Fig. 1b). We next crystallized murine CRY2 PHR 
together with FAD and obtained a 2.2 A resolution structure of the 
holoenzyme (Supplementary Table 1). As anticipated, FAD docks 
into the predicted cofactor pocket of murine CRY2 PHR and adopts 
the characteristic U-shaped conformation as seen in other crypto- 
chrome and photolyase structures (Fig. 1c). Superposition analysis 
shows that its binding mode is nearly identical to that revealed in 
the Drosophila (6-4) photolyase structure (Fig. 1d). 

A closer structural comparison, nevertheless, reveals a marked dif- 
ference at the cofactor-binding site between murine CRY2 PHR and 
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Figure 1 | Structure of murine CRY2 PHR in apo- and FAD-bound forms. 
a, Overall structure of the apo-bound form of the mouse CRY2 PHR (mCRY2 
PHR) domain. b, mCRY2 PHR induced quenching of FAD fluorescence. The 
inset shows FAD-fluorescence quenching with mCRY2 PHR concentration 
titrated. Error bars, s.d. c, A close-up view of FAD bound to mCRY2 PHR with 
positive F, - F, electron density contoured at 1.5 6 and calculated before the 
cofactor was built (green mesh). d, mCRY2 PHR and the Drosophila (6-4) 
photolyase (dm(6-4)-PL) are superimposed, with their FAD cofactors shown as 
sticks and their FAD carbon atoms in yellow and green, respectively. 
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other family members. In plant and insect (6-4) photolyases and cryp- 
tochromes, FAD is largely sequestered and buried deep inside the 
pocket as a prosthetic group””****. In marked contrast, the cofactor 
is only partially embedded in murine CRY2 PHR, with one side of its 
adenosine diphosphate group fully exposed to the solvent (Fig. 2a, b). 
This unusual exposure of the cofactor can be attributed to the uni- 
que features of two adjacent structural elements in murine CRY2 
PHR, the phosphate loop (a surface loop previously named for its 
phosphate-binding activity) and the protrusion motif. In Drosophila 
and Arabidopsis (6-4) photolyases, the adenosine moiety of FAD is 
mostly hidden under the well-ordered phosphate loop**”°. This loop 
harbours a central lysine residue that hydrogen bonds with the FAD 
adenine N7 atom and is secured further by a nearby sequence known 
as the protrusion motif (Fig. 2c). In murine CRY2 PHR, the protru- 
sion motif is shifted far away from FAD, whereas the phosphate loop 
becomes completely disordered in the crystal (Fig. 2c). These local 
structural variations in CRY2 result in an overall open conformation 
of the FAD-binding pocket, which might explain the moderate affi- 
nity of FAD. 


Overall structure of FBXL3-SKP1-CRY2 


The unique CCEs of vertebrate CRYs share a strictly conserved 11- 
amino-acid segment (conserved CCE sequence, CCS), which is fol- 
lowed by a highly variable C-terminal region (Supplementary Fig. 1b). 
Using a protein co-expression strategy, we noticed that the nearly full- 
length form of murine CRY2 (amino acids 1-544) with the intact CCS 
motif can form a stable complex with FBXL3-SKP1. This prompted 
us to crystallize and determine a 2.8 A resolution structure of the 
heterotrimeric complex (Supplementary Table 1). 

The overall architecture of the CRY2-FBXL3-SKP1 complex 
resembles an ice-cream cone with the globular CRY2 sitting on top 
of the cup-shaped FBXL3-SKP1 complex (Fig. 3). The FBXL3 protein 
harbours a canonical 3-helix F-box motif that interacts with the SCF 
adaptor protein SKP1, and its C-terminal leucine-rich-repeat (LRR) 
domain folds into a curved and sickle-shaped solenoid structure, 
whose concave surface wraps around the a-helical domain and the 
CCS region of CRY2 opposite to the «/B domain (Fig. 3 and Sup- 
plementary Fig. 4). The FBXL3-CRY2 complex buries more than 
4,800 A? solvent accessible surface area and spontaneously assembles 
in a phosphorylation-independent manner, which was confirmed by 
native protein mass spectrometry (Supplementary Fig. 5). 


The FBXL3 LRR domain 


The FBXL3 LRR domain contains 12 LRRs that pack in tandem to 
produce a semicircular arch (Fig. 4a). The solenoid topology of the 
LRRs dictates that the concave surface of the arch is formed by the 
parallel B-strands of the repeats and the convex side is decorated by 
the a-helices. We refer to the topside of the arch that is lined with 
intra-repeat loops as the apical ridge (Fig. 4b). 

Although the FBXL3 LRRs fold into a continuous single domain, it 
can be separated into two obvious halves as suggested by a promi- 
nent structural irregularity in LRR7 and LRR8. Compared to the six 
N-terminal LRRs, LRR7 has a much longer B-strand, which causes a 
significant offset at the apical ridge (Fig. 4b). This feature continues in 
LRR&, but is lost in LRR9 to LRR12. With a similar offset at the basal side 
between LRR8 and LRR9, all four C-terminal LRRs seem to be shifted 
up, relative to the six N-terminal repeats. Overall, LRR7 and LRR8 
mediate this spatial displacement and this hints at a possible gene fusion 
event when FBXL3 evolved in vertebrates. We name the two halves of 
FBXL3 LRRs as LRR-N (LRRI-LRR6) and LRR-C (LRR7-LRR12). 

A quick inspection of the FBXL3 LRR-CRY2 interface reveals that 
the two halves of the FBXL LRR domain have unequal roles in engaging 
the cryptochrome protein. All six repeats in LRR-C are in close contact 
with the o-helical domain of CRY2, whereas most parts of LRR-N, 
except its four apical loops, are separated from CRY2 by a gap (Fig. 4c). 
Consistent with a more important role of LRR-C in recruiting CRY2, 
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Figure 2 | The open FAD-binding pocket of murine CRY2 PHR. 

a, b, Surface representations of mCRY2 PHR and Drosophila (6-4) photolyase 
with their associated FAD cofactors shown as sticks. ¢, A close-up view of the 
open FAD-binding pocket of mCRY2-PHR shown with FAD as sticks and 
Drosophila (6-4)-photolyase superimposed (ribbons). Dotted blue line, 
disordered mCRY2 phosphate loop; dashed arrows, the differences of the 
phosphate loop and protrusion motif in the two proteins, respectively. 


the missense mutations in the alleles known as after hours and 
overtime, Cys358Ser and Ile364Thr substitutions, are mapped to 
LRR1O and LRR11, respectively (Fig. 4c and Supplementary Fig. 6). 
Because both residues are located in the hydrophobic core of the LRR 
fold, their substitution by polar amino acids probably perturbs the local 
conformation of LRR-C. 


The C-terminal tail of FBXL3 


The C-terminal tail of FBXL3 represents the hallmark of the FBXL3- 
CRY2 interface by sticking out from the flat concave surface of LRR-C 
and penetrating into CRY2 (Fig. 5a). The 12-amino-acid C-terminal 
region of FBXL3 is mostly invariant in vertebrates and terminates 
with a tryptophan residue (Supplementary Fig. 6). This sequence caps 
the LRR solenoid and then takes a sharp turn at Pro 422, inserting the 
5 amino acids closest to the C terminus into the o-helical domain of 
CRY2 (Supplementary Fig. 7). Notably, the last residue of the tail, 
Trp 428, reaches the core of the FAD-binding pocket and physically 
occupies the cofactor site. 

Superposition analysis shows that the FBXL3 C-terminal tail appro- 
aches the FAD-binding pocket from the same angle as the (6-4) 
DNA lesion approaches the FAD-binding pocket in the Drosophila 


Top view 


Figure 4 | Structure of the FBXL3 LRR domain. a, Ribbon diagrams of the 
LRR domain of FBXL3 with its complete C-terminal tail. Select LRRs are 
labelled and numbered at their helices and B-strands. b, An orthogonal view of 
the LRR domain shown in a. Double-ended arrows indicate the offset between 
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Figure 3 | Overall structure of the CRY2-FBXL3-SKP1 complex. a, Ribbon 
diagrams of the complex containing mCRY2 (blue), FBXL3 (magenta) and 
SKP1 (green). C, carboxyl terminus of FBXL3. The C-terminal helix and the 
CCS region in mCRY2 are shown in cyan and the F-box domain of FBXL3 is in 
grey. The inset shows a close-up view of the mCRY2 CCS region. b, Orthogonal 
view of the CRY2-FBXL3-SKP1 complex as shown in a. 


(6-4) photolyase-DNA complex structure*® (Fig. 5b). To expand the 
entrance, Met 425 of FBXL3 pushes aside an important CRY2 loop, 
which we name the interface loop. To enter the pocket, the FBXL3 tail 
stacks its Pro 426 residue against Trp 310 of CRY2 and flips the side 
chain of CRY2 His 373 with Thr 427 (Fig. 5c, d). In the FAD-bound 
form, these CRY2 residues directly contact the cofactor. Inside the 
pocket, the side chain of FBXL3 Trp 428 occupies the central space of 
the cavity, which corresponds to the narrow gap between the isoallox- 
azine and adenine rings of FAD. While donating a hydrogen bond to 
the backbone carbonyl group of Gln 307 in CRY2, the indole ring of 
Trp 428 makes multiple hydrophobic contacts with CRY2. The car- 
boxyl group of Trp 428 forms additional hydrogen bonds with CRY2 
Ser 414 and a signature arginine residue, Arg 376 (Fig. 5c, d). Overall, 
this network of interactions anchors the FBXL3 tail at the very centre of 
the FAD-binding pocket, predicting that FAD and the F-box protein 
will compete to bind the cryptochrome protein. 


Interface between FBXL3 and CRY2 PHR 


In addition to the C-terminal tail, the FBXL3 LRR domain also forms 
extensive interfaces with CRY2 through its solenoid fold. At the apical 
ridge of LRR-N, four FBXL3 loops from LRR1-LRR4 interact with the 
back of the o-helical domain of CRY2 opposite to the FAD-binding 
pocket (Figs 3b and 4c). This part of the interface is mediated largely 
by polar interactions (Supplementary Fig. 7). 

In contrast to LRR-N, the FBXL3 LRR-C subdomain binds CRY2 
through a predominantly hydrophobic interface. On its concave wall, 
LRR-C presents a cluster of more than ten hydrophobic residues strictly 
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Figure 5 | Interaction between the FBXL3 C-terminal tail and the CRY2 
FAD-binding pocket. a, Surface representation of the FBXL3 LRR-C 
subdomain with its C-terminal tail inserted into the FAD-binding pocket of 
mCRY2 (blue ribbons). b, The structure of Drosophila (6-4) photolyase-DNA 
complex shown from the same orientation as mCRY2 in a. FAD is shown as 
spheres. The damaged DNA substrate is shown as a stick diagram. c, d, A close- 
up view of the mCRY2 FAD-binding pocket showing key FAD- and FBXL3- 
interacting residues. FAD and the FBXL3 tail are shown as sticks. Dashed 
yellow lines represent hydrogen bonds. 


conserved in vertebrate CRYs (Fig. 6a and Supplementary Fig. 6). These 
residues, plus a circle of peripheral polar amino acids, recognize a large 
complementary surface on CRY2 adjacent to the cofactor pocket. The 
interface on the CRY2 side is constructed by three key structural motifs; 
the interface loop, the C-terminal helix and the CCS region (Fig. 6b). 
With a string of strictly conserved aromatic residues (Supplementary 
Fig. 8), the CRY2 interface loop is buttressed by Met 425 from the 
FBXL3 tail on one side and the CRY2 C-terminal helix on the other. 
The CRY2 CCS region adopts a well-ordered loop structure that wraps 
back onto the interface loop and the C-terminal helix. Together, these 
three motifs contribute a panel of hydrophobic residues grouped at the 
centre of the LRR-C binding interface. By making direct contacts with 
both FBXL3 and CRY2 PHR, the conserved CCS region of CRY2 pro- 
bably takes up its structural role at the interface through an ‘induced fit’ 
mechanism (Supplementary Fig. 9). 

To map regions that are critical for FBXL3-CRY2 interaction, we 
tested the binding activity of a series of FBXL3 and CRY2 mutants. 
Disruption of interactions at the apical ridge of LRR-N did not 
affect the complex formation (Asp181Ala and Tyr90Ala (Fig. 6c), 
Asp339Arg and Arg496Ala (Fig. 6d and Supplementary Fig. 7)), indi- 
cating a supportive role of LRR-N in recruiting CRY2. By contrast, 
CRY2 binding was impaired severely by either single amino acid 
substitution or truncation of the FBXL3 tail (Pro426Ala, Trp428Ala, 
Asp423A (Fig. 6c)). Similarly, alteration of a CRY2 residue in the 
cofactor pocket also compromised complex association (Trp310Ala 
(Fig. 5d, 6d)). These results pinpoint the FBXL3 C-terminal tail as a 
‘hot spot’ of the interface and suggest a possible antagonistic role of 
FAD in regulating the FBXL3-CRY2 interaction. In a purified system, 
we confirmed that free FAD, but not flavin mononucleotide (FMN), 
was sufficient to disrupt a pre-formed FBXL3-CRY2 complex in a 
dose-dependent manner (Fig. 6e). 

Our mutagenesis analysis also established the FBXL3 LRR-C sub- 
domain as a second critical docking site for CRY2. Mutations of 
individual aromatic residues at the hydrophobic concave wall of 
LRR-C or a nearby CCS-interacting residue effectively abolished 
CRY2 binding (Tyr292Ala, Tyr314Ala and Glu336Ala (Fig. 6a, c 
and Supplementary Fig. 9)). The same effect was seen when a single 
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Figure 6 | Structural and functional analyses of the FBXL3-murine CRY2 
interface. a, Surface and ribbon diagrams of the FBXL3 LRR-C subdomain 
showing hydrophobic and polar residues that are involved in mCRY2 binding. 
Amino acids selected for mutational analyses are labelled. The C-terminal tail 
and some N-terminal repeats are shown as ribbon diagrams. b, Ribbon 
diagrams of the mCRY2 structural elements involved in FBXL3 LRR-C 
interaction. Select interface residues are shown as sticks. A part of the FBXL3 
tail (magenta) is shown for reference. c, Interactions of retrovirus-expressed 
FBXL3 mutants with endogenous mCRY2 in mouse embryonic fibroblasts 
(MEF) assessed by co-immunoprecipitation (IP) and western blot analysis. 

d, Flag-mCRY2 complexes were immunoprecipitated with an anti-Flag resin 
from transfected HEK293T cells and assessed for binding to FBXL3, PERI, 
BMAL] and SKP1. Arg376Ala served as a negative control as it is probably 
detrimental to mCRY2 folding (see Supplementary Discussion for Cys430Ala 
and FBXL3-Cys340Ser mutants). e, Two in vitro competition assays showing 
the ability of FAD and PER2, but not FMN, to disrupt a preformed FBXL3- 
mCRY2 complex. f, MEFs were infected with retroviruses expressing either 
mCRY2 or mCRY2 mutants and treated with cycloheximide (CHX) for the 
indicated time in hours. Total extracts were examined by immunoblot analysis. 


hydrophobic-to-charged mutation was introduced to each of the 
three LRR-C-interacting structural motifs in CRY2; that is, the inter- 
face loop (Phe428Asp), the C-terminal helix (Ile499Asp) and the CCS 
region (Leu517Asp) (Fig. 6b, d). The functional importance of this 
part of the interface and the FBXL3 tail-gripping pocket was under- 
scored further by the stabilization of the CRY2 mutants with hot-spot 
residues altered at these sites (Fig. 6f and Supplementary Fig. 9). 
The C-terminal helix of CRY2 and its two basic residues, Arg 501 
and Lys 503, are thought to be essential for PER binding’. In com- 
plex with FBXL3, the entire CRY2 C-terminal helix is masked by the 
LRR domain of the F-box protein (Fig. 3b and Supplementary Fig. 9). 
We found by chance that mutation of Asp 339, which is in close prox- 
imity to Arg 501, also showed defective interaction with PER1 (Fig. 6d 
and Supplementary Fig. 7). These lines of evidence strongly suggest that 
PERs and FBXL3 share an overlapping CRY2-binding interface, which 
should prevent them from binding CRY2 simultaneously. Indeed, PER2- 
bound CRY2 is completely devoid of FBXL3 (Supplementary Fig. 10). 
Importantly, just like FAD, a purified C-terminal fragment of PER2, 
which corresponds to a previously mapped minimal CRY-binding 
region”, was able to dislodge CRY2 from FBXL3 (Fig. 6e). We were able 
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to confirm that the abundance of PER2 had a direct effect on the stability 
of both CRY1 and CRY2 (Supplementary Fig. 10)***°, probably by com- 
peting with the SCF'P*'? E3 ligase. 


Discussion 


The structure of the mammalian cryptochrome reveals an unexpected 
open FAD-binding pocket that probably evolved in vertebrates to 
enable dynamic FAD binding. This unique property is built on the 
plasticity of the phosphate loop guarding the entrance of the pocket. 
Intriguingly, this loop is highly conserved among vertebrates (Sup- 
plementary Fig. 8), raising the possibility that its alteration by 
phosphorylation””’* or CRY-binding proteins might regulate the 
affinity of FAD (Supplementary Fig. 11). 

Importantly, our structural results have unravelled the functional 
requirement for the dynamic FAD-binding pocket in CRYs and a 
novel mechanism of substrate-recognition by the SCF ubiquitin 
ligase. In contrast to the canonical degron-based substrate-engagement 
scheme*”**, FBXL3 takes advantage of the deep but accessible cofac- 
tor pocket of CRYs to capture the substrate (Supplementary Fig. 12). 
This interaction mode establishes FAD and its analogues as potential 
pharmacolo ogical agents for controlling clock oscillation by competing 
with SCF’ *™"*, Such an effect might be achieved by the CRY-stabilizing 
compounds identified in the recent circadian chemical screen”*. 

The unexpected interface between CRY2 and FBXL3 suggests that 
their interactions can be regulated by multiple mechanisms. As a 
common metabolic redox cofactor in the cell, it is possible that 
FAD directly controls the stability of CRYs by competing with the 
ubiquitin ligase. Although oxidized FAD binds murine CRY2 with a 
modest affinity (see Supplementary Discussion), its interaction with 
the CRY proteins can be influenced by its redox states and several 
factors mentioned above. The CRY2-FBXL3 structure also supports 
the idea that PERs may have a role in stabilizing CRYs by shielding 
them from SCF'®*'*. This function is reminiscent of the stabilization 
of PERs by CRYs against SCF?"°?-mediated degradation'**. Such 
interplay between the two obligate functional partners could serve as a 
mechanism for synchronizing their stability to ensure robust clock 
oscillation and phase shift. In fact, the CRY1-destabilizing effect of 
Ser 71 phosphorylation by AMPK may be partially explained by its 
reported activity in blocking PER2 binding”. 


METHODS SUMMARY 


The full Methods provides detailed information about all experimental proce- 
dures, including description of protein purification; protein crystallization, data 
collection and structure determination; FAD fluorescence assay; in vitro com- 
petition assay; and mutagenesis, cell culture, binding and stability analyses. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Recombinant protein purification. The mouse CRY2 (amino acids 1-544) was 
expressed as a glutathione S-transferase (GST) fusion protein in High Five 
(Invitrogen) suspension insect cells and isolated by glutathione affinity chro- 
matography using buffer containing 50mM Tris-HCl, pH 7.5, 300 mM NaCl, 
10% Glycerol, 10 mM DTT (dithiothreitol) and 0.5 mM FAD. The protein was 
purified further by cation exchange and gel filtration chromatography after on- 
column cleavage by tobacco etch virus (TEV) protease. During purification, the 
polypeptide produced a series of C-terminal degradation products that were not 
affected by the presence or absence of FAD and dithionite. Two shorter versions 
of the protein, murine CRY2 with amino acids 1-527 and murine CRY2 with 
amino acids 1-512, were subsequently purified. Native mass spectrometry ana- 
lysis confirmed that murine CRY2 with amino acids 1-512 was stable, but murine 
CRY2 with amino acids 1-527 was not (Supplementary Figs 1 and 2). The mouse 
CRY2 (amino acids 1-544) protein was co-expressed with full-length human 
FBXL3 and SKP1 in High Five suspension insect cells (human FXBL3 is 97% 
identical to mouse FBXL3, and the F-box and LRR regions of human and mouse 
FBXL3 differ by only 3 amino acids; Supplementary Fig. 6). FBXL3 was fused with 
an N-terminal GST tag, whereas the other two proteins were tag-free. The CRY2- 
FBXL3-SKP1 complex was isolated from the soluble cell lysate by glutathione 
affinity chromatography. After on-column tag cleavage by TEV, the complex was 
purified further by anion exchange and gel filtration chromatography and then 
concentrated by ultrafiltration to 10 mg ml”! in a buffer of 20mM Tris-HCl, 
pH8.0, 200 mM NaCl, 5mM DTT. Mouse PER2 (amino acids 1095-1235) was 
overexpressed as a GST-fusion protein in Escherichia coli and purified by glu- 
tathione affinity, anion exchange and gel filtration chromatography. 
Crystallization, data collection and structure determination. The crystals of 
the CRY2-FBXL3-SKP1 complex were grown at 4°C by the hanging-drop 
vapour diffusion method, using 1.5 pl protein complex sample mixed with an 
equal volume of reservoir solution containing 0.2 M ammonium citrate, 13-14% 
PEG3350 and 7% acetonitrile. Diffraction-quality crystals that were obtained 
were subjected to a post-crystallization dehydration procedure by gradually 
increasing the concentration of PEG3350 to 30%, and then directly frozen in 
liquid nitrogen. This procedure consistently improved the resolution of the crys- 
tals beyond approximately 4 A. The CRY2-FBXL3-SKP1 derivative crystals were 
prepared by soaking the native crystals in the buffer containing 15% PEG3350 
and 7% acetonitrile supplemented with 0.2 mM KAu(CN)), for 12h, followed by 
soaking in 0.5 mM KAu(CN), for 12h and then 1 mM KAu(CN), for a further 
12h. After soaking, the crystals were dehydrated by the same method as native 
crystals. The crystals of murine CRY2 were also grown at 4 °C by the hanging- 
drop vapour diffusion method with 1.5 pl protein mixed with an equal volume of 
reservoir solution containing 0.1 M MES- imidazole, pH 6.5, 7.2% PEG8K, 20% 
ethylene glycol (v/v), 0.03 M NaF, NaBr and Nal. The crystals were directly frozen 
in liquid nitrogen. The CRY2-FAD crystals were obtained by soaking the murine 
CRY2 crystals with 1 mM FAD in the reservoir buffer with extra 7.5% Glycerol for 
2h before freezing in liquid nitrogen. All data sets were collected at the BL8.2.1 
and BL8.2.2 beamlines at the Advanced Light Source of the Lawrence Berkeley 
National Laboratory. Reflection data were indexed, integrated and scaled with the 
HKL2000 package”. The murine CRY2 PHR domain structure was determined 
by molecular replacement using the Drosophila (6-4) photolyase structure 
(PDB:3CVU) as the search model. Soaking with FAD altered the unit cell and 
substantially improved the murine CRY2 PHR domain crystals. For the apo struc- 
ture, translation libration screw-motion (TLS) refinement was necessary to reduce 
the R-free value below 0.30. The CRY2-FBXL3-SKP1 structure was solved by 
molecular replacement combined with single-wavelength anomalous dispersion 
phases (MR-SAD) using the software PHENIX”. The structural models were 
manually built, refined and rebuilt with the programs COOT*' and PHENIX”. 
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FAD fluorescence assay. FAD fluorescence and binding studies were carried out 
in a PBS buffer, pH 7.4, with 10% glycerol at room temperature in a half-area 96- 
well plate with a total volume of 100 jl. Interaction was quantified by measuring 
the decrease in FAD fluorescence (excitation at 450 nm and emission at 520 nm) 
after binding to mouse CRY2. FAD concentration was held constant at 10 1M in 
all wells, and murine CRY2 PHR was serially diluted ten times from 300 1M to 
292 nM. Nonspecific FAD binding (background) was measured in parallel with 
equal molar concentrations of ovalbumin or lysozyme, which do not bind FAD. 
All protein concentrations were run in duplicate. The binding data was fit using a 
standard nonlinear regression curve fitting method with a log(inhibitor) versus 
response model in Prism 5 (GraphPad Software). 

In vitro competition assay. Strep-tagged full-length FBXL3, GST-tagged murine 
CRY2 (amino acids 1-544) and His-tagged SKP 1 were co-expressed in monolayer 
High Five insect cells. The complex was purified without elution by Strep-Tactin 
affinity resin in a buffer containing 100 mM Tris-HCl, pH 8.0, 300 mM NaCl and 
20 mM DTT. The Strep-Tactin resin beads with the immobilized complex were 
subsequently aliquoted in a volume of 100 ul with 1 mg ml total protein and 
placed in a panel of 7.5mm gravity columns. The beads were incubated with 
300 il of oxidized FAD, FMN or purified murine PER2 (amino acids 1095-1235) 
in a serial dilution (FAD: 0, 5, 50, 500 uM and 5mM; FMN: 50 and 500 uM; 
murine PER2: 0, 2.5, 5, 10, 20 and 40 uM) for 3h at 4°C. Flow-through and 
subsequent two-column-volume wash fractions were collected from each column 
and the GST-CRY2 protein released from the column was examined with SDS- 
PAGE and detected by western blot analysis using anti-GST antibody. 

Cell culture. HEK293T cells were maintained in Dulbecco’s modified Eagle’s 
medium (DMEM) containing 10% bovine serum. Immortalized mouse embry- 
onic fibroblasts (MEFs) were maintained in DMEM containing 10% fetal bovine 
serum (FBS). HEK293T cells were transfected with PEI (Polysciences) according 
to the manufacturer’s instructions. Cyclohexemide (Sigma) was used at a final 
concentration of 100M. For retrovirus production, GP-293 packaging cells 
(Clontech) were used. Forty-eight hours after transfection, the virus-containing 
medium was collected and supplemented with 8 1gml~’ polybrene (Sigma). 
Cells were then infected by replacing the cell culture medium with the viral 
supernatant for six hours. Forty-eight hours after infection, cells were selected 
with 1 pg ml? puromycin. 

Co-immunoprecipitation binding studies. Cells were collected and subse- 
quently lysed in lysis buffer (50mM Tris-HCl, pH7.5, 150mM NaCl, 1mM 
EDTA, 50mM NaF, 0.5% NP40, plus protease and phosphatase inhibitors). 
Immunoprecipitation was carried out with anti-Flag agarose beads (Sigma). 
Immuno-complexes were washed five times with lysis buffer and eluted in 
Laemmli buffer containing 1% SDS. Cell extracts or Flag-immunocomplexes 
were analysed by SDS-PAGE and then by immunoblot analysis with the follow- 
ing antibodies: anti-Flag (Sigma), anti-CRY1 and CRY2 (Bethyl), anti-PERI 
(Alpha Diagnostic), anti-PER2 (Bethyl), anti-BMAL1 (Bethyl), anti-Skp1 
(Invitrogen) and anti-FBXL3 (ref. 22). 

Plasmids. Wild-type and mutant mouse CRY2 complementary DNAs were sub- 
cloned into pcDNA3.1-Flag. Mouse PER2 was cloned into pMT-Myc. For retro- 
virus production, cDNAs encoding Flag or HA-tagged FBXL3 or CRY2 wild-type 
and mutants, were subcloned into the retroviral vector pBabePuro (Cell Biolabs). 
Point mutations were generated using the QuikChange Site-directed Mutagenesis 
kit (Stratagene). 
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Femtosecond switching of magnetism via strongly 
correlated spin-charge quantum excitations 


Tianqi Li'*, Aaron Patz’, Leonidas Mouchliadis**, Jiagiang Yan}, Thomas A. Lograsso’, Ilias E. Perakis** & Jigang Wang"? 


The technological demand to push the gigahertz (10° hertz) switch- 
ing speed limit of today’s magnetic memory and logic devices into 
the terahertz (10'” hertz) regime underlies the entire field of spin- 
electronics and integrated multi-functional devices. This challenge 
is met by all-optical magnetic switching based on coherent spin 
manipulation’. By analogy to femtosecond chemistry and photo- 
synthetic dynamics’—in which photoproducts of chemical and 
biochemical reactions can be influenced by creating suitable super- 
positions of molecular states—femtosecond-laser-excited coher- 
ence between electronic states can switch magnetic order by 
‘suddenly breaking the delicate balance between competing phases 
of correlated materials: for example, manganites exhibiting colossal 
magneto-resistance suitable for applications**. Here we show 
femtosecond (107 '* seconds) photo-induced switching from anti- 
ferromagnetic to ferromagnetic ordering in Pro.7Cap.3MnOs3, by 
observing the establishment (within about 120 femtoseconds) of a 
huge temperature-dependent magnetization with photo-excitation 
threshold behaviour absent in the optical reflectivity. The develop- 
ment of ferromagnetic correlations during the femtosecond laser 
pulse reveals an initial quantum coherent regime of magnetism, 
distinguished from the picosecond (107 '” seconds) lattice-heating 
regime characterized by phase separation without threshold beha- 
viour’®. Our simulations reproduce the nonlinear femtosecond 
spin generation and underpin fast quantum spin-flip fluctuations 
correlated with coherent superpositions of electronic states to 
initiate local ferromagnetic correlations. These results merge two 
fields, femtosecond magnetism in metals and band insulators’’°, 
and non-equilibrium phase transitions of strongly correlated elec- 
trons’”"’’, in which local interactions exceeding the kinetic energy 
produce a complex balance of competing orders. 

There is growing evidence that femtosecond transient polarization 
of condensed matter systems during a laser pulse can be used to 
manipulate spin or change magnetic order’”*'* °°. Non-adiabatic 
coherent photo-excitation during very early times can be used to con- 
trol subsequent slower dynamics driven by the free energy or excited 
potential surface’’. In strongly correlated materials, such as colossal 
magneto-resistive (CMR) manganite and high-T, copper oxide sys- 
tems, prior ultrafast spectroscopy experiments mostly concentrated 
on laser-induced melting of electronic charge order and orbital order 
(CO/OO), and on structural phase transformations’*'’. Thus far 
experiments directly probing magnetism only showed picosecond 
and nanosecond spin generation, much slower than the femtosecond 
excitation pulse, without threshold dependence on laser intensity°°”””. 
At such ‘long timescales’, lattice heating and phase separation dyna- 
mics dominate, as seen, for example, in some CMR manganites*® (see 
also ref. 24, where it is shown that laser heating of materials with 
high temperature ferromagnetic orders can induce magnetization). 
Transient magnetism there was mostly indirectly inferred by trans- 
ferring static pictures and quasi-equilibrium theories to the ultrafast 
regime. 


Here we excite the antiferromagnetic (AFM) CO/OO ground state of 
CMR manganites shown in Fig. 1a, which consists of one-dimensional 
zig-zag chains with alternating manganese Mn**/Mn°* atoms (charge 
order) and ferromagnetically-aligned spins. This particular antiferro- 
magnetic structure is called CE-type AFM**. Neighbouring chains are 
AFM-ordered, which restricts inter-chain electron hopping because of 
the large magnetic energy penalty (Hund’s rule). The Mn** atom 
thgorbitals*“ are occupied by three electrons forming local S = 3/2 spins, 
S;. Inthe Mn** atoms, an additional electron of spin s; populates a Mn ey 
orbital’*. The resulting Jahn-Teller distortion of MnOg octahedra lifts 
the e, orbital degeneracy on the Mn?" atoms, which are then populated 
by alternating d3,2__,2 j d3y2_,2 orbitals (orbital order)**. The measured 
phase diagram of our Pro 7Cap 3MnO ; manganite, Fig. 1b, results from 
the competition between these AFM CO/OO insulating and ferromag- 
netic metallic phases. The ground state is however always insulating 
under weak magnetic fields (B < 0.5 T in our experiment), for all rele- 
vant temperatures. It exhibits CO/OO below ~200K and CE-type 
AFM below the Néel temperature, Ty ~ 140K (shaded area). Thus, 
heating of our system cannot induce ferromagnetism, unlike those 
systems with high-temperature ferromagnetic orders”. 

This Letter reports a femtosecond AFM to ferromagnetic phase 
transition in Pro. 7Cap.3MnO3. In particular, we observe a threshold 
excitation fluence for inducing a huge magnetization during coherent 
nonlinear optical excitation of the CE-type AFM CO/OO ground state 
by a 82-fs laser pulse. To see this, we performed two-colour—near- 
infrared pump (1.55eV), ultraviolet probe (3.1 eV)—femtosecond- 
resolved magnetic circular dichroism (MCD) and magneto-optical 
Kerr rotation (MOKE) measurements. This geometry allows us to 
observe, at 3.1 eV, genuine femtosecond spin dynamics of the tog elec- 
tron magnetization component S, perpendicular to the sample surface, 
induced by the pump (see Methods). We measure femtosecond 
changes in the complex magneto-optical angle Ox = 0; +in,, where 
the imaginary part 1, (MCD) and real part 0, (MOKE) correspond to 
absorption difference and phase shift, respectively, between right- and 
left-circularly polarized light, induced by S,(t). 

Typical temporal profiles of the pump-induced ellipticity change, 
Anj, during the first 5 ps are shown in Fig. 1c, at 30 K and for two pump 
fluences. An, >0 indicates photo-induced magnetization from the 
AEM ground state. At low pump fluence, 0.64mJ cm” (red filled 
circles), we only observe a gradual, picosecond magnetization rise. 
An increase of pump fluence to 5.8 mJ cm’ * (open circles) reveals a 
remarkable femtosecond regime of spin photogeneration: a huge 
quasi-instantaneous jump in magnetization, three orders of mag- 
nitude larger than in magnetic semiconductors””’, now precedes pico- 
second spin dynamics. Deconvolution of the An, and probe pulse 
temporal profiles give a fast magnetization rise time of ~120 fs 
(Fig. 1c, inset). The clear discontinuity of An; (dashed line, Fig. 1c) 
marks two different temporal regimes of transient magnetism. 
Figure 2a shows the detailed pump-fluence dependence of An: 
it reveals a photo-excitation threshold, I, ~ 2.5 mJ cm’, for the 


Department of Physics and Astronomy, lowa State University, Ames, lowa 50011, USA. @Ames Laboratory - USDOE, Ames, lowa 50011, USA. *Department of Physics, University of Crete, Heraklion, Crete 
71003, Greece. “Institute of Electronic Structure and Laser, Foundation for Research and Technology-Hellas, Heraklion, Crete 71110, Greece. +Present addresses: Department of Materials Science and 
Engineering, University of Tennessee, Knoxville, Tennessee 37996, USA; Materials Science and Technology Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, USA. 


4 APRIL 2013 | VOL 496 | NATURE | 69 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


Classical spin hopping 


lyW> 


Mn+ lt*> 
, 


1.55-eV 
pump 


Quantum spin hopping 


3.1-eV 


robe For example, one eigenstate is 


| tt> +1 tut> 


Figure 1 | A quantum many-body scheme for femtosecond switching of 

magnetism and ultrafast photo-induced spin dynamics in Pro 7Cap 3MnO3. 
a, Schematics of ultrafast excitations of the CE-type AFM/CO/OO order, whose 
spin, charge, and orbital lattice pattern, shown in the left panel, consists of one- 
dimensional AFM-coupled zig-zag chains. For classical spins (right top panel), 
electron conduction and optical transitions are restricted within the same one- 
dimensional ferromagnetic chain because J = S — 1/2 total spin configurations 
are suppressed by Hund’s rule (white arrow, classical spin hopping). Quantum 
spin-flip fluctuations (right bottom), however, allow e, electrons to hop on sites 
with opposite local th spin, by forming non-equilibrium total spin eigenstates 
(see equation (1)) as illustrated (red arrow, quantum spin hopping). b, The 

equilibrium phase diagram of our Pro. 7Cag 3MnO; sample as a function of 

temperature and magnetic field (sweeping up). The red dots show the measured 


emergence of femtosecond magnetization. In contrast, the charge 
dynamics, measured by the pump-induced reflectivity change AR/R 
(Fig. 2b), exhibits no threshold, which rules out melting of the CO/ 
OOas the main origin of I,,. The critical fluence dependence exclusively 
for the femtosecond spin dynamics represents a hallmark of non- 
equilibrium magnetic phase transitions. 

Further evidence for femtosecond change of magnetic ordering is 
presented in Fig. 3a, which compares the MCD An, and MOKE A@, at 
two pump fluences and demonstrates notable differences between fem- 
tosecond and picosecond time intervals. Both MOKE and MCD dem- 
onstrate the existence of a fluence threshold for observing a distinct 
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boundary between the antiferromagnetic and ferromagnetic phases. Note that, 
for sufficiently small magnetic fields, the antiferromagnetic insulating phase 
persists at all temperatures. c, Ultrafast photo-induced ellipticity change An, for 
magnetic field B = 0.5 T and temperature T = 30 K for low (0.64 mJ cm 7”, red 
dots) and high (5.8 mJcm’”, open circles) pump photo-excitation intensity. 
The femtosecond magnetism (temporal regime (1)) is clearly distinguished 
from the picosecond magnetism (temporal regime (2)) and only appears for 
high intensity. Inset, the instrument response time (shaded) and the 
convolution of the probe pulse with a double-exponential time-dependent 
magnetization increase. We extract time constants t, = 120 fs (fast 
femtosecond magnetization) and tz = 18,500 fs (slow picosecond 
magnetization). Open circles, experimental results (Exp); solid curve, double- 
exponential fit (Fit). 


femtosecond pump-induced magnetization, with the same I,, in both 
(Supplementary Information). Such coincidence corroborates our con- 
clusions. In contrast, the picosecond components of An, and Aé, 
exhibit different temporal profiles (for example, a damped oscillation 
is absent in MOKE) without threshold, indicative of slow thermal non- 
magnetic contributions®®. In Fig. 3b, the femtosecond magnetization 
vanishes when the magnetic field is reduced from 0.5 T to OT, again 
pointing to the magnetic origin of the “discontinuity in Ay, and AQ; 
and, thereby, the fast spin generation during the excitation pulse. 
Furthermore, the observed femtosecond spin photogeneration 
depends sensitively on temperature and AFM order (Fig. 3c). At 


Figure 2 | A three-dimensional view demonstrating the distinct 

femtosecond spin and charge dynamics as well as photo-excitation threshold 
behaviour. a, Time-resolved ellipticity change An, and b, differential optical 
reflectivity AR/R as function of pump fluence. Measurements are taken under 
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the same experimental conditions. A photo-excitation threshold 

Ty, 2.5 mJ cm ” is seen for emergence of femtosecond magnetization (An;,), 
whereas the charge dynamics exhibits no threshold. Magnetic field B = 0.5 T 
and temperature T' = 30K. 
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Figure 3 | Evidence for genuine femtosecond switching of magnetic 
ordering and its sensitive temperature dependence. a, Comparison between 
ellipticity An, (solid lines) and MOKE A6, (dashed lines) under pump 
excitation fluences of 4.48 mJ cm? (black) and 1.60 mJ cm ° (red). 

b, Magnetic field dependence of the ultrafast photo-induced ellipticity change 
during the first 5 ps (split axis) and for extended time intervals up to 60 ps 


200 K and 150K, that is, above the AFM phase transition, we observe 
negligible femtosecond magnetization components for all fluences 
used. The femtosecond magnetization appears at 50 K above thresh- 
old, reaches its maximum around 30K, and then decreases again at 
lower temperatures of 20K and 10K. We attribute this behaviour to 
the competition between spin stiffness—describing the rigidity of the 
AFM order—and thermal fluctuations, which display opposite tem- 
perature dependence. We note that the measured temperature for 
maximum femtosecond spin generation, 30K, corresponds to the 
minimum critical B field required for driving an AFM to ferromagnetic 
phase transition in Fig. 1b. 

Figure 4a and b compares the pump-fluence dependence of An, and 
AR/R at two fixed times for the data shown in Fig. 2a and b, At = 200 fs 
and 6 ps: we make two observations. First, any threshold dependence 
of photo-induced magnetization clearly smears out in the picosecond 
An, data (black open circles), which shows a linear power dependence, 
and is absent in both picosecond (red open circles) and femtosecond 
(filled red squares) AR/R. Thresholdless behaviours can be attributed 
to phonon-assisted phase separation®®. However, the distinct non- 
linear pump fluence and threshold dependence of the femtosecond 
spin dynamics (Fig. 4a) reveal a new femtosecond magnetism non- 
thermal temporal regime during coherent nonlinear photo-excitation, 
well-distinguished from the picosecond thermal temporal regime. 
Second, the femtosecond build-up of ferromagnetic correlations above 
threshold Jy, coincides with a transition, at Ky, from linear to non- 
linear intensity dependence of AR/R (shaded area). Such coincidence 
suggests that global electronic conducting paths emerging from photo- 
induced interchain hopping are important for femtosecond (but not 
picosecond) magnetism. 

Can we attribute our observation of femtosecond magnetism to the 
photodoping of the AFM ground state with electrons and holes? A 
transition from AFM to ferromagnetic states with electron doping 
was predicted for Hamiltonians without CO/OO”, but these states are 
unstable to phase separation: ferromagnetic polarons, that is, an electron 


(logarithmic scale). Black, B = 0.5 T; red, B = 0 T. T = 30K. c, Temperature 
dependence of the photo-induced ellipticity for the first 4 ps under two pump 
excitation fluences of 0.64 mJ cm” (red) and 5.80 mJ cm”? (black). Note the 
non-monotonic dependence of the femtosecond photo-induced magnetization 
on temperature: it peaks at around 30K. 


surrounded by a ferromagnetically aligned neighbouring spin ‘cluster’, 
form locally inside the AFM matrix”®. Above critical doping, polaron 
percolation results in a global ferromagnetic ground state. At equilib- 
rium, electron doping of the CE/CO/OO state (Fig. la, x = 2 + Ax) 
gives an unconventional ferromagnetic metal (x < 1/2), whereas the 
system remains CE-AFM insulating with hole doping (x > 1/2). Such 
electron-hole asymmetry, important for photodoping effects, arises 
from competition between electron delocalization (by canting anti- 
parallel tj, spins) and self-trapping (by Jahn-Teller distortions)”’. 
However, the latter lattice processes are slow (picosecond)”. During 
femtosecond times shorter than the Jahn-Teller phonon period and 
thermalization times, magnetization can only be induced by fast spin- 
flip/canting non-equilibrium processes. 

Below, we propose a non-equilibrium spin-exchange mechanism 
that produces ferromagnetic correlations during the femtosecond laser 
pulse. Classical spin scenarios*” neglect spin-flips, and so electron 
hopping from atom to atom diminishes with increasing angle between 
the local spins’. Consequently, interchain e,-electron hopping is sup- 
pressed by AFM spin alignment, resulting in electronic confinement 
within one-dimensional chains (Fig. 1a, white arrows). This, however, 
suppresses any femtosecond spin dynamics, as the e, and ty, spins 
remain parallel within the same chain”. We must thus turn to 
quantum spin scenarios and, additionally, engage the transient coher- 
ence, correlation and nonlinearity that dominate during the laser 
coherent excitation, a new paradigm in ultrafast magnetism. Quan- 
tum spin-flips (Fig. 1a, red arrows) allow for electron hopping to a site 
with antiparallel local spin, as we can now populate the degenerate 
non-equilibrium quantum states of photo-excited and local electron 
spins shown schematically in Fig. la, which diagonalize the strong 
Hund’s rule interaction: 
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Figure 4 | Interchain femtosecond quantum spin-flip fluctuations establish 
ferromagnetic correlations during the coherent laser photo-excitation. 

a, Pump fluence dependence of photo-induced Anx and b, AR/R, after two time 
intervals, At = 200 fs (filled squares) and At = 6 ps (open circles), following 
photo-excitation, at 30 K. These summarize the data presented in Fig. 2a and b. 
The nonlinear fluence dependence with threshold I, marked by the vertical 
dashed blue line is only seen for the femtosecond Ax. Note the different 
behaviour between the red shaded area (high intensity revealing femtosecond 
magnetism) and the low intensity regime (only picosecond magnetism). 
Although the photo-induced optical reflectivity change does not display any 
threshold behaviour, the high intensity red shaded regime above Iyy, 
demonstrates a saturation nonlinear behaviour, while for low intensities below 
Ix, we observe the linear dependence on photo-excitation intensity expected 
from the usual perturbation expansion in terms of the optical field. c, Calculated 
photo-induced total spin for three Rabi energy values similar to those in the 
experiment, d= 70 meV (black), 0.3d (blue), 0.1d(red), and inter-atomic 
coherence (inset) for d = 70 meV. The laser pulse time-dependence is 
superimposed (grey; E’(t)), demonstrating that ferromagnetic local 
correlations transiently build-up from the AFM ground state (total spin zero) 
during the photo-excitation, with nonlinear dependence on Rabi energy and 
hence intensity. The time-dependence of the interatomic coherence describes 
‘photo-induced bond’ order, which implies strong transient coupling between 
neighbouring atoms during the lifetime of the non-thermal populations, that is, 
before the establishment of a quasi-equilibrium hot temperature. The damped 
oscillations in the photo-induced spin, with period determined by the Jahn- 
Teller energy, reflect deviations from adiabaticity. 


where |iSS.) are the Mn*" fp,-spin eigenstates. The eigenvalues of the 
total spin J; = s; + S;areJ =S + 1/2andM=-—J,---,J. cd adds a spin- 
o electron in orbital ~ at site i. The quantum-coherent superposition 
given in equation (1), suppressed in the classical spin limit S— ~, allows 
for ultrafast quantum spin dynamics, driven by the off-diagonal Hund’s 
rule magnetic interaction J};S7"-s;* . 

In the insulating AFM state, we expect localized photo-excitations”*. 
We thus describe the spin, charge and orbital populations at the ith 
atom by the diagonal density matrix elements p,(m) = (|iSm){iSm|) 
(Mn**) and pM) = (|ixM)ixM|) (Mn**). The 3.1-eV magneto- 
optical signal probes the hg spin’® S,(i)= S> mS cos Om p;(m) + 
Yo am S cos Ou p;(aM), where cos 0,,=m/S and cos 04, = MI]. 
The 1.55-eV femtosecond pump field, tuned across the Jahn-Teller 
insulator gap, drives coherent electron hopping between neighbouring 
atoms i and j with different Jahn-Teller distortions. Such quantum 
kinetics is hidden in static measurements”, and the equations of 
motion couple p(m) and pM) to the off-diagonal density matrix 
(a UM" exe ( iM) ) describing time-dependent electron-hole super- 
positions of electronic quantum states given by equation (1) in two 
different atoms i and j (bond orders). The composite fermions created 
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by él, (iM) = |inM) (18M -$| treat the strong correlation (see 


Supplementary Information)”. 

The solution of our density matrix equations for a two-site inter- 
chain cluster with ground state M = $+ 1/2 (Jahn-Teller-distorted 
site i) and m=-—S (empty site j) numerically demonstrates the 
development of ferromagnetic correlations with femtosecond time- 
dependence and nonlinear dependence on photo-excitation similar 
to Fig. 1c, the key features of the femtosecond quantum spin regime. 
The calculated photo-induced spin AS_(ij) = AS,(i)+ AS-(j) is shown 
in Fig. 4c for three Rabi energies, together with the laser pulse (here the 
Rabi energy is d = eEa, where E is the optical field, e is the electron 
charge, and a is the lattice spacing). The decrease in Rabi energy, from 
d=70mevV (black) to 0.1d (red trace), diminishes the femtosecond 
spin dynamics as the perturbative regime is recovered for small d. 
Ferromagnetic correlation, AS,(ij) > 0, develops during laser excita- 
tion, driven by nonlinear photo-excited interatomic coherences 
A et, (jM’ éus(iM) ) (see inset, Fig. 4c for d= 70 meV) and elec- 
tron-hole asymmetry. This spin correlation is enhanced by fast virtual 
electron hopping due to non-thermal/coherent nonlinear population 
changes and photo-induced bonding. The hole, created at Mn°* site i 
populated by parallel e, and t, spins, leaves the t,, spin unchanged. 
The electron is created at the neighbouring Mn*” site j populated by tog 
spin S, = S (intrachain excitation) or S, = —S (interchain excitation). 
It populates a Mn** quantum state of the form given by equation (1) 
with M=S,+ 0/2 after interacting with the local spin. Interchain 
electron-hole excitations increase the S, = —S spin via spin-flip (equa- 
tion (1)), thus inducing ferromagnetic correlations between AFM- 
aligned chains. Intrachain excitations move the $+ 1/2 total spin 
along the chain and thus only affect AR/R and the charge distribution. 
The collective behaviour of these photo-induced ferromagnetic ‘defects’ 
is affected by a small magnetic field, which creates a preferred direc- 
tion through spin canting”, resulting in non-zero collective spin and 
enhanced interchain hopping. Such a B-field can also facilitate percola- 
tion of the photo-induced magnetic clusters, establishing macroscopic 
ferromagnetic order above a critical I,, as the local ferromagnetic cluster 
size grows nonlinearly. Our predicted femtosecond quantum spin 
canting/flip also gives rise to interchain conductive paths, which may 
explain Ix; in the femtosecond-resolved AR/R (Fig. 4b). 


METHODS SUMMARY 


Sample growth and characterization. Single crystals of Pro 7Cap3MnO; were 
grown by the floating-zone method in flowing oxygen with a growth rate of 3 mm 
h”'. X-ray powder diffraction on pulverized single crystals confirmed the single 
phase. The phase diagram in Fig. 1b was constructed by measuring the temper- 
ature- and field-dependence of magnetic susceptibility and electrical resistivity of 
our samples. Strong magnetic fields melt the AFM/CO/OO insulator ground state 
and induce a transition to a ferromagnetic metallic state, for example, at a critical 
magnetic field of B=5T at T~75K. This cannot be accessed by heating the 
sample. For our ultrafast measurements, we placed the sample in a weak magnetic 
field of no more than 0.5 T, pointing perpendicular to the sample surface. 
Ultrafast magneto-optical spectroscopy. A Ti:sapphire amplifier with centre 
wavelength of 800 nm (1.55 eV), pulse duration of 82 fs at the sample position, 
and 1 kHz repetition rate was separated into pump and probe beams. The probe 
was frequency-doubled to 400 nm (3.1 eV). This photon energy produces a large 
magneto-optical activity due to optical transitions between oxygen 2p and Mn fy, 
states”, and thereby mostly measures dynamics of ty, local spins. The probe beam 
was kept nearly perpendicular to the sample surface (polar MOKE geometry) (~7° 
from the normal). To a first approximation, the magneto-optical signal is propor- 
tional to k-S, where k is the vector of the probe light. Thus, the signal measured 
reflects the out-of-plane magnetization component S, of tz. local spins. Our pump 
beam is linearly polarized, with a tunable excitation fluence of up to ~6 mJcm *. 
There is no obvious pump polarization dependence. This two-colour pump-probe 
geometry was shown to maximize the magneto-optical response while minimizing 
the contamination of magneto-optical signals during femtosecond timescales, for 
example, by dichroic bleaching, spectral weight transfer associated with the elec- 
tronic phase transitions, and so on. More information on data analysis and theor- 
etical modelling that solves the time-dependent equations of motion using the 
Runge-Kutta routines is given in Supplementary Information. 
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Three-dimensional imaging of dislocations in a 
nanoparticle at atomic resolution 


Chien-Chun Chen’?*, Chun Zhu>?*, Edward R. Whiteb?, Chin-Yi Chiu?*, M. C. Scott>?, B. C. Regan’, Laurence D. Marks’, 


Yu Huang’? & Jianwei Miao’? 


Dislocations and their interactions strongly influence many mater- 
ial properties, ranging from the strength of metals and alloys to the 
efficiency of light-emitting diodes and laser diodes'*. Several 
experimental methods can be used to visualize dislocations. Trans- 
mission electron microscopy (TEM) has long been used to image 
dislocations in materials*°, and high-resolution electron micro- 
scopy can reveal dislocation core structures in high detail", par- 
ticularly in annular dark-field mode''. A TEM image, however, 
represents a two-dimensional projection of a three-dimensional 
(3D) object (although stereo TEM provides limited information 
about 3D dislocations*). X-ray topography can image dislocations 
in three dimensions, but with reduced resolution’”. Using weak- 
beam dark-field TEM” and scanning TEM", electron tomography 
has been used to image 3D dislocations at a resolution of about 
five nanometres (refs 15, 16). Atom probe tomography can offer 
higher-resolution 3D characterization of dislocations, but requires 
needle-shaped samples and can detect only about 60 per cent of the 
atoms in a sample’’. Here we report 3D imaging of dislocations in 
materials at atomic resolution by electron tomography. By apply- 
ing 3D Fourier filtering together with equal-slope tomographic 
reconstruction, we observe nearly all the atoms in a multiply 
twinned platinum nanoparticle. We observed atomic steps at 3D 
twin boundaries and imaged the 3D core structure of edge and screw 
dislocations at atomic resolution. These dislocations and the atomic 
steps at the twin boundaries, which appear to be stress-relief 
mechanisms, are not visible in conventional two-dimensional pro- 
jections. The ability to image 3D disordered structures such as dis- 
locations at atomic resolution is expected to find applications in 
materials science, nanoscience, solid-state physics and chemistry. 

Crystallographic techniques are currently the primary means for 
determining the 3D atomic structures of crystals. They determine ato- 
mic positions by averaging over many unit cells. As a complement to cry- 
stallographic techniques, electron tomography is an ideal technique for 
imaging the 3D local structure of materials at high resolution’*’°"**°. 
By combining annular dark-field scanning TEM with the centre-of-mass 
(CM) and equally sloped tomography (EST) methods, electron tomo- 
graphy has recently achieved a resolution of 2.4 A (ref. 20) However, 
dynamical scattering effects*', the missing-wedge problem'*'*”° and 
Poisson noise in the tilt series introduce noise in the EST reconstruction. 
Consequently, although lattice structure and some individual atoms 
are visible in the reconstruction”, electron tomography has not been 
able to reveal 3D dislocations in materials at atomic resolution. Here we 
overcome this obstacle by combining 3D Fourier filtering with high- 
angle annular dark-field scanning TEM (HAADF-STEM) tomography, 
and achieve 3D imaging of dislocations in a nanoparticle at atomic 
resolution. 

Platinum (Pt) nanoparticles were synthesized by peptide sequences 
in aqueous solution (Methods). To make the nanoparticles more 


stable under an electron beam, a thin (~1-2nm) carbon layer was 
deposited on the nanoparticles (Methods). Using HAADF-STEM™, we 
acquired a tilt series of projections from a Pt nanoparticle (Methods). 
Supplementary Figs 1 and 2 show the tilt series of 104 projections with 
equal-slope increments and a tilt range of =72.6°. To monitor beam- 
induced changes to the Pt nanoparticle, three 0° projections were mea- 
sured while acquiring the tilt series (Supplementary Fig. 3). The consist- 
ency of these projections indicates that the lattice structure of the 
nanoparticle was stable throughout the experiment. 

After performing background subtraction and CM alignment 
(Methods), the tilt series was reconstructed by the EST method”*”* °° 
(Methods). Figure 1a shows the 3D Fourier transform of the recon- 
struction and Fig. 1c shows a 2.6-A-thick central slice in the x-y plane, 


Figure 1 | 3D reconstruction of a multiply twinned Pt nanoparticle before 
and after applying a 3D Fourier filter. a, 3D Fourier transform of the raw 
reconstruction of the nanoparticle. b, 3D Fourier transform of the 
reconstruction after 3D Fourier filtering where the {111} and {200} Bragg peaks 
are labelled with red and black dots, respectively. c, A 2.6-A-thick central slice in 
the x-y plane of the raw reconstruction, where the z axis is along the beam 
direction. d, The same slice of the 3D structure after applying a 3D Fourier filter, 
in which nearly all the atoms (in white) are visible. The clear boundary of the 
nanoparticle is due to the multiplication of the 3D structure with a 3D shape 
obtained from the EST reconstruction (Methods). The insets show an enlarged 
region of the atomic positions before and after applying a 3D Fourier filter. 
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where the electron beam is along the z axis. However, owing to the low 
signal-to-noise ratio (SNR) in the EST reconstruction, 3D dislocations 
within the nanoparticle cannot be identified at atomic resolution. To 
enhance the SNR of the reconstruction, we developed a 3D Fourier 
filtering method to identify all the measurable 3D Bragg peaks and the 
3D distribution around each peak (Methods). Figure 1b shows the 3D 
Fourier transform of the reconstruction after 3D Fourier filtering, in 
which the red and black dots indicate the {111} and {200} peaks of the 
Pt nanoparticle, respectively. By applying an inverse Fourier transform 
to Fig. 1b and multiplying it by the 3D shape of the Pt nanoparticle 
determined from the EST reconstruction, we obtained the 3D structure 
of the nanoparticle with a size of ~11.2 X 10.7 X 7.1 nm. Supplemen- 
tary Video 1 and Fig. 1d show 3D volume renderings and a 2.6-A-thick 
central slice in the x-y plane of the Pt nanoparticle, in which nearly all 
the atoms are visible. 

To verify the 3D Fourier filtering method, we performed a compar- 
ison with a 3D Wiener filter using the same experimental data. The 
Wiener filter is well peplihed for reducing the noise in a signal, and it 
is applied to TEM images”’. Supplementary Fig. 5b-d shows the 2.6-A- 
thick central slice in the x-y plane of the reconstruction after applying 
the 3D Wiener filter S?/(S” + An), where S is an estimate of the signal, 
n is the noise and A is a parameter that controls the filtering strength 
(larger values of A give stronger filtering). Supplementary Fig. 5f-j 
shows enlarged views of four regions for the raw reconstruction, and 
the reconstructions with the 3D Wiener filter (2 = 1, 2 and 3) and the 
3D Fourier filter. Although the result with the 2 = 1 Wiener filter is 
nosier, the atomic positions in the reconstructions using the 2 = 2 and 
3 Wiener filters and the 3D Fourier filter are consistent. 


Figure 2 | Grain boundary comparisons between a 2D experimental 
projection and several 2.6-A-thick internal slices of the reconstructed Pt 
nanoparticle. a, Experimental projection in the x-y plane suggesting that this 
is a decahedral multiply twinned nanoparticle and that the twin boundaries 
(red lines) are flat. Blue lines show two subgrain boundaries. To enhance the 
image contrast, a 2D Fourier filter was applied to the projection. b, A 2.6-A- 
thick internal slice indicating the existence of atomic steps at the twin 
boundaries (red lines). The subgrain boundaries (blue lines) are two lattice 
spacings wider than those in a. c, Enlarged view of a twin boundary in b. d and 
e, a 2.6-A-thick slice above and below the slice of ¢, revealing that the atomic 
steps vary in consecutive atomic layers. f, Enlarged view ofa stacking fault in the 
2.6-A-thick internal slice, which is in good agreement with the classical model 
for a face-centred-cubic extrinsic stacking fault (inset). These images, as well as 
those in Figs 3 and 4, are displayed with Amira. 
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To further examine the 3D Fourier filtering method, we per- 
formed numerical simulations on a 7.3 X 7.0 X 4.5nm? decahedral 
Pt nanoparticle with multislice calculations. The Pt particle consists 
of 4,015 atoms with edge and screw dislocations (Supplementary Figs 
6a-c, 7a and b). A tilt series of 63 projections with a tilt range of +72.6° 
and equal-slope increments was calculated by performing multislice 
STEM simulations (Supplementary Fig. 8). Two levels of Poisson 
noise were added to the 2 Projections of the tilt series with total electron 
doses of 2.52 10°eA * and 5.67 10*eA *, corresponding to 
Rnoise = 10% and 20%, respectively, where Rnoise is an R-factor used 
to define the level of Poisson noise (Methods). The two tilt series were 
aligned and reconstructed by the CM and EST methods*’?*¢ 
(Methods, Supplementary Fig. 6d-f, j-l). Because of the low SNR in 
the reconstructions, not all the atoms or dislocations are visible in the 
raw 3D reconstructions, especially with Ryoise = 20% (Supplementary 
Fig. 6j-l). After applying a 3D Fourier filter to the raw reconstructions 
(Methods), we obtained two 3D structures of the simulated Pt nano- 
particle with all the atoms resolved (Supplementary Fig. 6g-i, m-o). 
Furthermore, 3D grain boundaries, as well as the 3D core structures of 
edge and screw dislocations were determined at atomic resolution and 
are consistent with those in the model (Supplementary Figs 6g-i, m-o 
and 7c-f). 

After verifying 3D Fourier filtering with a 3D Wiener filter and 
multislice simulation data, we analysed 3D dislocations of the Pt nano- 
particle obtained from the experimental tilt series. Figure 2 shows grain 
boundary comparisons between a 2D experimental projection and 2.6- 
A-thick internal slices of the reconstructed particle. The experimental 
projection in the x-y plane suggests that this is a decahedral multiply 


Figure 3 | Observation of the 3D core structure of an edge dislocation at 
atomic resolution. a, A 7.9-A-thick internal slice of the nanoparticle. The 
lattice structure on the left and at the bottom parts of the slice is not well defined, 
mainly because this decahedral multiply twinned nanoparticle consists of five 
grains with different orientations. b, An enlarged view of an edge dislocation in 
a where red dots represent the position of the atoms. c, d and e, 2.6-A-thick 
atomic layers sectioning through the slice of b. The three consecutive atomic 
layers indicate the dislocation line is in the direction of [101]. The Burgers vector 


(b) of the edge dislocation was determined to be a [101]. 
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twinned nanoparticle** with flat twin boundaries (Fig. 2a and 
Supplementary Fig. 9). However, a 2.6-A-thick internal slice in the 
x-y plane and an enlarged view indicate the existence of atomic steps 
at the twin boundaries (Fig. 2b, c) that are hidden in the projection 
(Fig. 2a). Figure 2d and e shows enlarged views of a twin boundary ina 
2.6-A-thick slice above and below the slice of Fig. 2b, revealing that the 
atomic steps vary in consecutive atomic layers. These atomic steps are 
also independently verified by applying 3D Wiener filtering to the 
same experimental data (Supplementary Fig. 10). In addition, subgrain 
boundaries in the 2.6-A-thick internal slice (Fig. 2b) are two lattice 

spacings wider than those in the projection (Fig. 2a). Figure 2f shows 
an enlarged view of a stacking fault in the 2.6-A-thick internal slice 
ending at a twin boundary, which agrees well with the classical model 
for a face-centred-cubic extrinsic stacking fault' (inset in Fig. 2f). 

In addition to twin boundaries, subgrain boundaries and stacking 
faults, we observed the 3D core structure of edge and screw disloca- 
tions at atomic resolution in the Pt nanoparticle. Figure 3a and b shows 
a 7.9-A-thick internal slice of the nanoparticle and an enlarged view of 
an edge dislocation, where the red dots indicate the atomic positions. 
By computationally ‘sectioning’ the 7.9-A-thick slice, we obtained 
three consecutive atomic layers, each 2.6-A thick (Fig. 3c—e). The three 
consecutive atomic layers indicate that the dislocation line is in the 
[101] direction, and the Burgers vector of the edge dislocation was 


determined to be = [101] (Fig. 3c). To visualize a screw dislocation, a 


5.3-A-thick slice (two atomic layers) in the (111) plane was selected 
(Supplementary Fig. 11b) and then tilted to the [011] direction 
(Fig. 4a). Figure 4b shows an enlarged view of the slice where the zigzag 
pattern, a characteristic feature of a screw dislocation, is visible. To 
better visualize the screw dislocation, we display surface renderings of 
the enlarged region (Fig. 4c), where the atoms indicated by green dots 
are in the top layer and those indicated by red dots are in the bottom 
layer. The zigzag pattern is more clearly visualized in the surface ren- 
derings, in which the green line connects the atoms in the top layer and 
the red line connects the atoms in the bottom layer. The Burgers vector 


Figure 4 | Observation of the 3D core structure of a screw dislocation at 
atomic resolution. a, Volume renderings of a 5,3-A-thick slice (two atomic 
layers) in the (111) plane (Supplementary Fig. 11b), tilted to the [011] direction 
to visualize the zigzag pattern, a characteristic feature of a screw dislocation. 
b, Enlarged view of a screw dislocation showing the zigzag pattern. c, Surface 
renderings of the screw dislocation where the atoms represented by green dots 
are in the top layer and those by red dots are in the bottom layer. The zigzag 
pattern is more clearly visualized, the Burgers vector (b) of the screw dislocation 


was determined to be > [011], and the width of the screw dislocation was 


estimated to be ~8.9 A. 
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of the screw dislocation was determined to be — 5l0 11], and the width of 


the screw dislocation was estimated to be ~8. 9 A, which is consistent 
with the results obtained by combining high-resolution TEM with 
image simulations for Au and Ir (ref. 29). 

Careful analysis of the position of the screw dislocation inside the Pt 
nanoparticle suggests that the screw dislocation is associated with 
atomic steps at a twin boundary (Supplementary Fig. 11). Although 
it is often thought that nanoparticles cannot support dislocations, this 
is not the case for multiply twinned particles such as the decahedral 
one imaged here. These contain a ~2% angular strain and a disclina- 
tion; at larger sizes this can in principle be relieved by dislocations”. A 
recent analysis implied that about one-third of this strain is accom- 
modated at the twin boundaries”, but as a 2D projection method was 
used in this analysis, it did not provide further insight. Our results 
strongly suggest that the twin boundaries are not flat and that disloca- 
tions associated with atomic steps at the boundaries account for the 
strain relaxation (Supplementary Fig. 11). 

The significance of the present work is twofold. First, 3D atomic- 
resolution imaging of dislocations allows us to observe new structures 
that are not visible in conventional 2D projections; this is expected to 
advance our fundamental understanding of dislocations in materials. 
Second, although discrete tomography through the use of a priori 
information has been applied to reconstruct 3D surface morphology 
of a small crystalline nanoparticle at atomic resolution*', EST-based 
electron tomography in combination with 3D Fourier filtering repre- 
sents a general method for 3D atomic resolution imaging of the local 
structure in nanomaterials. Although nanoparticles are used in this 
study, this method could, in principle, be applied to 3D imaging of thin 
materials at atomic resolution; the sample thickness is limited only by 
dynamical electron scattering. 


METHODS SUMMARY 


Pt nanoparticles were synthesized by using peptide sequences in aqueous solution 
at room temperature” and supported on 5-nm-thick silicon nitride membranes. 
To stabilize the nanoparticles under a STEM beam, a thin (~1-2 nm) carbon layer 
was deposited on the Pt nanoparticles and the electron energy was kept below the 
knock-on radiation damage threshold of Pt. Using HAADF-STEM (energy: 
200keV; spherical aberration: 1.2mm; illumination semi-angle: 10.7 mrad; 
detector inner and outer angles: 35.2 mrad and 212.3 mrad, respectively), a tilt 
series of 104 projections was acquired from a Pt nanoparticle with equal-slope 
increments” and a tilt range of +72.6°. To monitor beam-induced changes to the 
nanoparticle, three 0° projections were measured during the acquisition of the tilt 
series (Supplementary Fig. 3); they indicated that the Pt nanoparticle was stable 
throughout the experiment. After performing background subtraction and CM 
alignment for the tilt series, a 3D reconstruction of the nanoparticle was obtained 
using the EST method”*****. However, 3D dislocations of the nanoparticle cannot 
be identified in the raw 3D reconstruction at atomic resolution. To enhance the 
SNR of the reconstruction, we developed a 3D Fourier filtering method to identify 
all the measurable 3D Bragg peaks and the 3D distribution around each peak. The 
3D Fourier transform of the EST reconstruction consists of {111} and {200} Bragg 
peaks. We first determined a two-shell volume with all the measurable Bragg peaks 
and set other voxels to zero. We then chose the most intense {111} Bragg peak as a 
reference peak. After optimizing the threshold based on the reference peak 
(Supplementary Fig. 4) and performing convolution with a three-voxel-diameter 
sphere, we identified all the measurable peaks and the 3D distribution around each 
peak (Fig. 1b). After applying the inverse Fourier transform to Fig. 1b, the 3D 
structure of the Pt nanoparticle was obtained in which nearly all the atoms are 
visible. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Definition of R,oise. An R-factor was used to define the level of Poisson noise in 
each multislice simulation tilt series, 


> | Prise (XY) _ Po tated (x,y)| 
xy 


0 
»~ Pimmutated (%Y) 
xy 


0 


noise 


(1) 


where Pe slate (x,y) is the projection calculated from multislice STEM simulations 
at angle 0, and p! 


noise 
computing Rie 
the projections. 
Synthesis of Pt nanoparticles. The Pt nanoparticles were synthesized by peptides 
in aqueous solution at room temperature, as reported previously”. All reagents 
were dissolved in water before using. A pre-prepared vial containing precursor 
(chloroplatinic acid hydrate (H2Pt(Iv)Cls*xH,O, 1 mM) and S7 peptide (Ser-Ser- 
Phe-Pro-Gln-Pro-Asn) solution (30 mg ml!) were mixed with ascorbic acid 
(2mM) immediately before injecting fresh NaBH, (0.8 mM), where NaBH, and 
ascorbic acid were used as reducing agents. The final volume of the reaction 
solution was 5 ml, and the reaction normally required more than 30 min. 
Sample preparation. Pt nanoparticles were deposited on a 5-nm-thick silicon 
nitride membrane. The membrane, with a size of 100 um X 1,500 jum, is supported 
on a 100-j1m-thick silicon frame (http://TEMwindows.com). To dissipate charge 
efficiently and make the nanoparticles more stable under an electron beam, a high- 
temperature ultrathin carbon coating (http://TEMwindows.com) was applied to 
the nanoparticles based on the following procedure. The silicon nitride membrane 
grid was first placed in a vacuum chamber and the temperature was increased from 
300 °C to 700 °C ata rate of 10°C s_*. The carbon was coated during a 5-min soak 
at 700 °C. The chamber then naturally cooled to 450 °C over the next 5 min before 
removing the grid. The silicon nitride membrane grid was finally loaded on a 
tomographic sample holder (Fischione 2020) for data acquisition. 
Acquisition of tomographic tilt series using HAADF-STEM. STEM images of 
the Pt nanoparticles were acquired on a FEI Titan 80-300 microscope (energy: 
200 keV; spherical aberration: 1.2 mm; illumination semi-angle, 10.7 mrad). The 
100-pA electron beam was focused to a probe with a 50-tm probe-forming aper- 
ture (C2) and rastered over the sample. The scattered electrons were captured by a 
Fischione 3000 HAADF detector with angles between 35.2 mrad and 212.3 mrad 
relative to the optical axis. HAADF angles were used to reduce the nonlinear 
intensities and the diffraction contrast in the images. The maximum tilt angles 
were limited by the holder to +75°. To reduce vibration and drift during data 
acquisition, the sample holder was allowed to settle for 1 h after inserting into the 
microscope and also for several minutes after moving to each new angle. Tilt series 
were manually acquired by changing the angle with equal-slope increments”’”***. 
When focusing an image, a nearby nanoparticle (generally within 20 nm along the 
rotation axis) was first viewed, thus reducing unnecessary radiation dose to the 
particle under study”®. Using this low-exposure acquisition scheme, a tomographic 
tilt series of 104 projections with equal-slope increments and a tilt range of +72.6° 
was acquired from a Pt nanoparticle. The probe current was ~100 pA with a dwell 
time of 48 [ls per pixel, and the magnification of each projection was 3.6 X 10°. The 
total electron dose of the tilt series was estimated to be ~2.5 X 10’e A. Because 
the pixel size in STEM mode may vary, a calibration image of an oriented single- 
crystal Au foil (Ted Pella) was taken in STEM mode under the same conditions, 
and the STEM pixel size was calibrated to be 0.35 A. To enhance the SNR in the 
projections of the tilt series, 1.5 X 1.5 pixel binning was performed for each pro- 
jection. The pixel size of the binned projections is 0.53 A. 
Background subtraction and CM alignment. To use the EST iterative algorithm, 
background subtraction has to be performed for each projection. We implemented 
background subtraction and CM alignment based on the following procedure. 
First, we selected the 0° projection and chose a suitable cut-off value to subtract 
its background. After projecting the 0° projection onto the y axis (that is, the tilt 
axis) to obtain a one-dimensional (1D) curve, we calculated the centre of mass of 
the 1D curve (yc), and set yoy as the origin of the y axis. This 1D curve was used 
as a reference curve. Second, we optimized the background subtraction for other 
projections based on the reference curve. We scanned the cut-off values from 0.5 to 
1.5 times the mean value of each projection with an increment of 0.01. For each cut- 
off value, we projected the projection onto the yaxis to obtain a 1D curve. By 
shifting the 1D curve pixel-by-pixel along the y axis, we calculated the difference 
between the 1D curve and the reference curve. We recorded the smallest difference 
and the corresponding shift for each 1D curve. After scanning through all the cut- 
off values, we plotted the smallest differences against the cut-off values, which 
should give a U-shaped curve. We identified the minimum corresponding to the 
optimized cut-off value and the shift of a given projection. After performing 
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background subtraction with the optimized cut-off values, we calculated the centre 
of mass (ye) for all the 1D curves. Ifall of them (that is, rounded yc) were not at 
the origin, we adjusted the cut-off value for the 0° projection (that is, the reference 
curve) and repeated the above steps until the best agreement was achieved. Finally, 
after performing background subtraction and aligning the projections along the 
y axis, we projected all the projections onto the x axis. We calculated the centre of 
mass (Xcq) for all 1D curves, and set xcy as the origin of the xaxis*®. After 
repeating this process, all the projections were aligned to the tilt axis. 

EST reconstruction. The EST iterative algorithm started with converting the 
measured projections to Fourier slices by the fractional Fourier transform*’. The 
algorithm then iterated alternately between real and reciprocal space through the 
use of the pseudopolar fast Fourier transform*’. In real space, the voxel values 
outside a loose support (that is, a rectangular box larger than the true boundary of 
the structure to be reconstructed) and the negative voxel values inside the support 
were set to zero, whereas in reciprocal space the corresponding calculated Fourier 
slices were replaced with the measured ones and the remaining slices kept 
unchanged in each iteration. Each iteration was monitored by an error metric, 
defined as the difference between the measured and calculated Fourier slices, and 
the algorithm was terminated after reaching a maximum number of iterations. 
Using the iterative EST algorithm, a preliminary 3D reconstruction was obtained 
after 500 iterations. An updated 3D support was determined by convolving the 
reconstruction with a Gaussian window and selecting a suitable cut-off. The 3D 
shape of the support was also double checked by examining the reconstruction 
slice-by-slice to ensure the support does not crop the structure. Using the updated 
support, we performed another 500 iterations to obtain a new reconstruction. To 
further improve the 3D reconstruction, we also projected the reconstruction back 
to calculate projections at given angles. By computing the cross-correlation 
between the calculated and measured projections, we further adjusted the align- 
ment of the projections to achieve maximum consistency in 3D reconstruction. 
Usually the shift should be one pixel or smaller in each dimension. Otherwise, the 
data analysis and CM alignment procedure has to be re-done. We then repeated 
the procedure for improving the support and back-projection alignment. The final 
reconstruction was obtained when no further improvements can be made. The 
details of the EST algorithm can be found in refs 20, 23-26. 

3D Fourier filtering. As a result of dynamical scattering effects”’, the missing- 
wedge problem'®'*”° and Poisson noise in the tilt series, not all the atoms or 
dislocations are visible in the raw 3D EST reconstruction. To enhance the SNR 
in the reconstruction, we developed a 3D Fourier filtering method by using the 
following procedure. First, the 3D Fourier transform of the raw reconstruction of 
the Pt nanoparticle consists of two sets of lattice planes {111} and {200}. The 
intensities of the {111} peaks were estimated to be several times higher than those 
of the {200} peaks. We calculated the average radial distance (d) between the {111} 
and {200} peaks. Two radii were then determined by Rj, =R,,,;—d and 
Rout = Rooo + d, where Ri; and Roo are the average radial distance for the 
{111} and {200} peaks, respectively. By keeping those voxels in the 3D Fourier 
transform with their radii between Ri, and Ro. and setting other voxels to zero, we 
obtained a two-shell volume including all the measurable 3D Bragg peaks. 

Next, we implemented a method to further reduce noise among the Bragg peaks 
within the two-shell volume. We chose the most intense {111} Bragg peak as a 
reference peak and calculated thresholds based on the reference peak. We scanned 
the thresholds from 1% to 20% of the reference peak in steps of 1%. For each 
threshold, we set voxels with values larger than the threshold to one and other 
voxels to zero, and obtained a 3D mask. The 3D mask was convolved with a three- 
voxel-diameter sphere to compute a new 3D mask, where the convolution process 
was to retain the 3D distribution of each Bragg peak. By multiplying the new 3D 
mask with the Fourier transform of the raw reconstruction, we obtained a new 3D 
Fourier transform. By monitoring the change in the noise among the Bragg peaks, 
we found that a threshold with 10% of the reference peak is large enough to remove 
noise among the 3D Bragg peaks, while retaining all the measurable {111} and 
{200} peaks and the 3D distribution around each peak (Fig. 1b and Supplementary 
Fig. 4). The optimized threshold of 10% of the reference peak obtained here may 
vary for different samples. 

Finally, by applying the inverse Fourier transform to Fig. 1b and multiplying it 
by a 3D shape (that is, a tight support) obtained from the EST reconstruction, we 
obtained the 3D structure of the Pt nanoparticle (Supplementary Video 1 and 
Fig. 1d). We confirmed the accuracy of the 3D Fourier filtering method by using 
two independent approaches: (1) multislice STEM simulations* of a decahedral Pt 
nanoparticle with edge and screw dislocations (Supplementary Figs 6-8) and (2) 
performing a comparison with a 3D Wiener filter’ on the same experimental data 
(Supplementary Figs 5 and 10). In our numerical simulations, we also found that, 
compared to 2D Fourier filtering method****, 3D Fourier filtering is more accur- 
ate. This is because in 3D Fourier filtering, each voxel in 3D reciprocal space is 
correlated to all voxels in 3D real space, and vice versa. But in 2D Fourier filtering, 
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the correlated information only exists in two dimensions. Thus, for a given object, 
there is more correlated information (voxels) in 3D Fourier filtering than in the 2D 


case. 
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Colossal injection of catalyst atoms into silicon 
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The incorporation of impurities during the growth of nanowires 
from the vapour phase alters their basic properties substantially, and 
this process is critical in an extended range of emerging nanometre- 
scale technologies’*. In particular, achieving precise control of the 
behaviour of group III and group V dopants has been a crucial step in 
the development of silicon (Si) nanowire-based devices’. Recently* 
it has been demonstrated that the use of aluminium (Al) as a growth 
catalyst, instead of the usual gold, also yields an effective p-type dop- 
ing, thereby enabling a novel and efficient route to functionalizing Si 
nanowires. Besides the technological implications, this self-doping 
implies the detachment of Al from the catalyst and its injection into 
the growing nanowire, involving atomic-scale processes that are cru- 
cial for the fundamental understanding of the catalytic assembly of 
nanowires. Here we present an atomic-level, quantitative study of 
this phenomenon of catalyst dissolution by three-dimensional atom- 
by-atom mapping of individual Al-catalysed Si nanowires using 
highly focused ultraviolet-laser-assisted atom-probe tomography. 
Although the observed incorporation of the catalyst atoms into nano- 
wires exceeds by orders of magnitude the equilibrium solid solubility” 
and_ solid-solution concentrations in known non-equilibrium 
processes'*"*, the Al impurities are found to be homogeneously dis- 
tributed in the nanowire and do not form precipitates or clusters. As 
well as the anticipated effect on the electrical properties, this kinetics- 
driven colossal injection also has direct implications for nanowire 
morphology. We discuss the observed strong deviation from equilib- 
rium using a model of solute trapping at step edges, and identify the 
key growth parameters behind this phenomenon on the basis of a 
kinetic model of step-flow growth of nanowires. The control of this 
phenomenon provides opportunities to create a new class of nanoscale 
devices by precisely tailoring the shape and composition of metal- 
catalysed nanowires. 

The nanowires investigated in this study were grown on Si(111) in 
an ultrahigh-vacuum chemical deposition system'®'’. Their growth 
was accomplished with Al-Si nanoparticles, which act catalytically 
as the energetically favoured sites for vapour-phase reactant (SiH,) 
adsorption and (when saturated) as the nucleation sites for crystalli- 
zation and one-dimensional growth’®. The resulting nanowires are 
aligned along the [111] direction and display morphological charac- 
teristics that are sensitive to the growth temperature (Supplementary 
Figs 1 and 2). Growth at the highest temperature (470°C) yields 
tapered nanowires with an average height and base diameter of 
475 + 30nm and 94+ 8nm, respectively (Supplementary Fig. 1). 
Figure la displays a high-resolution transmission electron microscopy 
(TEM) micrograph of an Al-catalysed Si nanowire near the interface 
with the catalyst. This micrograph demonstrates that both nanowire 
and catalyst nanoparticle are single crystals with a coherent interface 
between them. However, the close atomic numbers of }*Al and !Si 
make them indistinguishable in high-resolution TEM images of nano- 
wires. Energy-dispersive X-ray analysis (EDX) in an aberration- 
corrected TEM did not find a significant Al signal in Si nanowires 


grown at the lowest temperature (410 °C). This indicates that the Al 
concentration is less than the EDX detection limit, which is typically 
~0.5 atomic per cent (at.%). For a rigorous analysis, we used atom- 
probe tomography (APT), which has a superior detection sensitivity of 
less than ten atomic parts per million (p.p.m.), with atomic-scale spa- 
tial resolution’’. Pulsed evaporation of individual atoms was achieved 
using a focused picosecond ultraviolet (wavelength 1 = 355 nm) laser. 
The use of a highly focused ultraviolet laser beam decreases local 
heating, thereby improving the detection limits and mass resolving 
power (m/Am, where m is the mass of an isotope)”*. 

Figure 1b shows a three-dimensional atom-by-atom map ofa single 
Al-catalysed Si nanowire covered by a protective nickel (Ni) layer. The 
inset in Fig. 1b displays a cross-sectional TEM image of an identical 
nanowire. Figure 1c shows two projections of an 80-nm-long segment 
ofa Si nanowire delineated by an isoconcentration surface drawn at the 
50 at.% Si level. 

The 50 at.% Al isoconcentration surface of the catalyst nanoparticle 
is displayed in Fig. 2a. Axial and radial concentration profiles of Aland 
Si, taken from the top 10nm of the nanoparticle, are presented in 
Fig. 2b and c, respectively. Both concentration profiles demonstrate 
Si segregation close to the nanoparticle’s surface, possibly as a result of 
Si expulsion during post-growth cooling or simply due to oxidation on 
exposure to air. The average Si concentration measured at the core of 
the nanoparticle is in the range of 2.7 + 0.9 to 3.1 + 0.4 at.%, which is 
nearly twice the solubility of Si in Al at the eutectic temperature’’. This 
indicates that Si solubility in Al increases in undercooled catalyst. 
Figure 2d displays the three-dimensional distribution of Al atoms 
detected in an as-grown Si nanowire. The average Al solid concentra- 
tion ce ) measured for Si nanowires grown at 410 °C is ~(2.0 + 0.5) 
x 10°°atom cm * (~0.4 at.%). Strikingly, this is about four orders of 
magnitude greater than the extrapolated equilibrium solubility of Al in 
Si (ref. 12), which corresponds to an increase of two to three orders of 
magnitude compared to Al-assisted recrystallization’* or solid-phase 
epitaxy'* at a similar temperature. Notwithstanding this colossal Al 
concentration, clusters or precipitates of Al do not form. To verify this 
quantitatively, we statistically analysed the spatial distribution of Al 
atoms (Fig. 2e) and performed partial Al-Al radial distribution func- 
tion analyses. This analysis shows that the concentration frequency 
distribution is not different from a binomial distribution with the same 
mean at 99% confidence, and thus confirms that Al does not form 
precipitates or clusters in Si nanowires. 

Figure 3a displays a set of radial concentration profiles measured in 
8-nm-diameter cylinders in different regions across the nanowire 
grown at 410 °C, perpendicular to the growth direction (Fig. 3a insets). 
The estimated mean x‘, value varies between 0.39 and 0.45 at.%. The 
profiles display random fluctuations indicative of a random distri- 
bution of Al in Si nanowires. This is also consistent with the Al axial 
concentration profiles (Fig. 3b). Indeed, a averaged over a cylin- 
drical volume remains practically the same regardless of the diameter 
of the selected volume, thereby confirming the uniform distribution of 
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Figure 1 | Structure and three-dimensional map of Al-catalysed Si 
nanowires. a, High-resolution cross-sectional TEM image (left) displaying the 
interface between the catalyst particle and the nanowire. The right panel 
exhibits a close-up image of the interface (top), the fast Fourier transform of the 
image (middle), and the corresponding colour-filtered image (bottom, Al and 
Si regions correspond to red and blue regions, respectively) indicating that the 
interface is epitaxial. b, Three-dimensional APT atom-by-atom map of a 


Al. Analysis of x3, in Si nanowires grown at a higher temperature 
(470 °C) demonstrates a significantly stronger incorporation of Al, 
leading to concentrations sufficiently large to be detected by EDX 
(Fig. 3c). Interestingly, the Al radial concentration displays flat pro- 
files, indicating a uniform distribution of Al in the nanowire. Surface 
segregation of Al would have led to ‘U’-shaped profiles. Similarly, the 
Al axial concentration profile indicates a uniform concentration 
(~4.3 at.%) along the nanowire. We note the ~10-fold increase in 
x$, when compared to the nanowires grown at 410 °C. 

It is generally recognized that nanowire tapering is induced by 
vapour-solid growth on sidewall surfaces* and/or catalyst diffusion 
from one nanowire to another”. As demonstrated here, this is not 
the case for Al-catalysed nanowires, for which the observed incorpo- 
ration of catalyst atoms during growth is the key element determining 
nanowire morphology. It is noteworthy that the uniform Al radial 
concentration profile (Fig. 3c) rules out vapour-solid deposition on 
the nanowire sidewalls as a possible mechanism for nanowire tapering. 
Specifically, if homoepitaxy did take place on the nanowire’s sidewalls, 


LETTER 


nanowire grown at 410 °C. For the sake of clarity, only a limited number of 
atoms is displayed (2.5 x 10* atoms of each element). Inset, a cross-sectional 
TEM image of an identical Si nanowire (scale bar, 40 nm). ¢, Si 50 at.% 
isoconcentration surface of an 80-nm-long segment of a nanowire determined 
by analysing a three-dimensional atom-probe tomographic reconstruction: left, 
side view; right, top view. 


the overgrown layer would have been Al-free, because Al diffusion is 
negligible under these growth conditions”’. The strong incorporation 
of Al catalyst atoms into the nanowire that we observe should involve a 
continuous dissolution of the catalyst throughout the growth process, 
with direct implications for nanowire morphology (Supplementary 
Information, section D). For instance, at 470 °C, the total volume of 
Al atoms incorporated in the completely grown Si nanowire is 
~5.5 X 10*nm’, which is significantly larger than the volume of the 
catalyst at the end of growth (~1.3 X 10* nm’). This means that there 
is more Al in the Si nanowire than in the nanoparticle at the end of the 
growth process—that is, the catalyst has lost more than 80% of its 
initial volume. This leads to a continuous reduction of the diameter 
of the catalyst nanoparticle during growth, and thus to nanowire 
tapering. We note, however, that the catalyst’s volume loss is very small 
(~6%) at 410 °C, which explains the absence of tapering at this tem- 
perature (Supplementary Fig. 2). 

The observed uniform Al concentration distributions suggest that the 
dissolution of Al atoms and their injection into the growing nanowire 
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Figure 2 | APT data obtained for a catalyst nanoparticle and a Si nanowire. 
a, Al 50 at.% isoconcentration surface of a catalyst nanoparticle. b, c, Axial 

(b) and radial (c) concentration profiles of Aland Si in the catalyst nanoparticle; 
error bars, +1 s.d. calculated for the analysed volume containing 2.9 X 10° Al 
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atoms and 8 X 10° Siatoms. d, Three-dimensional distribution of Al atoms ina 
Sinanowire grown at 410 °C. e, Results of statistical analysis of Al concentration 
frequency (histogram) and of binomial analysis (solid line). The binomial 
analysis was performed by sampling blocks containing 200 atoms. 
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Figure 3 | Al concentration profiles in individual nanowires. a, Al radial 
concentration profiles (left) measured in different regions of a Si nanowire. 
Each concentration profile is taken from a cylindrical segment having a 
diameter of 8 nm, as indicated in the images (right). b, Al axial concentration 
profiles (left) measured for cylindrical segments (right) with a diameter of 

10 nm (top) and 20 nm (bottom). The horizontal dashed line in all 
concentration profiles denotes the average concentration, which is given as 


occurs at a constant rate during the growth process. Moreover, the fact 
that Al displays uniform radial and axial concentration profiles suggests 
the existence of a single atomic pathway behind this colossal incorpora- 
tion. We anticipate that surface effects, capillarity and related nanoscale 
stresses will shift the boundaries in the phase diagram of the catalyst- 
nanowire system in comparison to those of a bulk system’*. However, by 
taking into account these thermodynamic considerations and including 
the nanowire tapering (Supplementary Information, section E), we 
found that the calculated solid composition of the nanowire is always 
smaller than the equilibrium content, which is of the order of 1 atomic 
p-p.m. at the growth temperature of our Si nanowires (Supplementary 
Fig. 11). This stands in sharp contrast to the observed excess concentra- 
tions (that is, concentrations above the equilibrium value) of orders of 
magnitude in our Si nanowires, which demonstrates that kinetic factors 
must be responsible for the incorporation of Al atoms. 
Supersaturations of several orders of magnitude occur as a result of 
deviations from local equilibrium in ultra-rapid solidification of a 
melt**. This phenomenon (so-called solute trapping) has been 
observed for a variety of solutes in Si for growth rates of the order of 
metres per second (refs 24, 25). In general, solute trapping is quantified 
in terms of a partition coefficient, k, which is the ratio of the solute 
concentration in the solvent to its concentration in the melt at the solid— 
liquid interface, as a function of the equilibrium partition coefficient 
(k.), the average interface velocity (Vin), and a characteristic velocity 
known as diffusive velocity (vp)*°. Because nanowires grow by succes- 
sive addition of bilayers through step flow’’, the average interface velo- 
city is defined as v,,i, = b/t, where b = 0.3 nm is the thickness of a 
Si(111) bilayer and 1 is the total time needed for one bilayer to grow 
(t includes the time needed to reach the supersaturation (the incubation 
time, T;) and the time between the end of the incubation period (that is, 
the beginning of nucleation) and the complete growth of the bilayer, 
Tp). Note that the incubation time is a nanoscale phenomenon, not 
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blue numerals; the shaded band is the uncertainty interval; and error bars in 
a and b represent +1s.d. The number of Al atoms in the sampling volumes 
varies between ~2,000 and 4,500. c, EDX Al concentration profiles: left and 
middle plots, radial concentration profiles near the base and middle of the Si 
nanowire, respectively; right plot, axial concentration profile. Error bars, 
approximately +0.5 at.%, which is the sensitivity of the instrument. Insets, the 
corresponding TEM images with EDX scan lines (dashed lines). 


expected for macroscopic systems”. Under our growth conditions, Vmin 
equals 0.4 and 0.2nms 1 at 470 and 410°C, respectively. Intriguingly, 
from studies of rapid solidification of melts*’, one can infer that solute 
trapping should not occur for systems with an interface velocity of the 
order of nanometres per second, and thus the anticipated amount of Al 
in Si nanowires should not exceed the equilibrium concentration. We 
note that the influence of temperature on the rate of the thermally 
activated jumps is not sufficient to explain the observed incorporation 
of Al at exceedingly low interface velocities (Supplementary Informa- 
tion, section F). 

Current solute trapping models” involve the assumption that 
atomic jumps occur over the entire infinite liquid—solid interface, 
and solute atoms are trapped only after the passage of the next layer 
across the nanowire surface. Although this mechanistic picture 
describes the behaviour in bulk systems very well”*, it cannot be 
extended to nanowire growth, which is characterized by a relatively 
large delay between two successive bilayers. During this incubation 
time, there is no driving force for atoms to leave the catalyst and thus 
the exchange (that is, the process of atomic jumps) should stop until 
the system reaches supersaturation. It is also important to note that in 
step-flow growth, atoms at the step edges have few nearest neighbours, 
which provides them with more steric freedom for an exchange pro- 
cess. Thus, it is reasonable to assume that atomic jumps are more 
favourable at the step edges than at terraces. This suggests that the 
impurity atoms become frozen in the solid immediately after the forma- 
tion of the next row of atoms at the step edge. Therefore, the time, t,, 
during which the local exchange is possible corresponds to the average 
time needed to add one row, and can be expressed as Te = 4/Vstep, Where 
a is equivalent to the width of one row of atoms and Vstep is the average 
step velocity. This latter can be expressed as a function of nanowire 
radius Ryw: Vstep = 2Rnw/tpr. Thus, at fixed radius, t. = tp, (a/ 
2Rnw) = Tpi/n, where n= 2Rnw/a> 1. The parameter n depends 
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only on step speed (Supplementary Information, section F). Experi- 
mentally, we can only determine the average time between the comple- 
tion of one bilayer and the completion of the next, t = 1; + tp.. By 
taking f= tp,/t, the exchange time can be written as 1, = (f/n)t. 

The transport of impurity atoms between the two sides of the inter- 
face for 0 < t < t, can be described on the basis of chemical rate theory 
and mass conservation’’”*. Assuming that the step growth is random 
rather than periodic” (Supplementary Information section F), the 
partition coefficient can be expressed as: 
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where D; is the coefficient of interdiffusion at the interface (vp = D,/b). 
Figure 4a displays the evolution of the calculated k in (fn) space for 
nanowires grown at 470°C (t=0.75s and k,=2.6 X10 °) using 
Vp =6.7nms | obtained from earlier solute trapping models (Sup- 
plementary Information section F). We note that any pair (fn) (except 
around (1;1)) predicts Al trapping at levels always significantly higher 
than the measured value (k = 5.6 X 10°”). At these trapping levels, 
nanowire growth cannot occur. Obviously, diffusive velocities of the 
order of nanometres per second cannot describe the observed Al injec- 
tion in nanowires. The complete map of Vp values fitting the measured k 
values is displayed in Fig. 4b. Note that the expression for tpr can be 
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Figure 4 | Calculations of partition coefficient and diffusive velocity. a, The 
calculated partition coefficient, k (on colour scale and labelled on contours), for 
nanowires grown at 470°C using a diffusive velocity (vp) of 6.7nms ' 
obtained from an early solute trapping model”. Axes show f (given by tp,/t) 
and n (given by tp1/Te); see text for details. Inset, the calculated k near 

(fn) = (151). b, The map in (fn) space of diffusive velocities, vp (on colour scale 
and labelled on contours), fitting the measured partition coefficient at 470 °C 
(k = 5.6 X 10 ”). It is important to indicate that only low values of f have a 
physical meaning in nanowire growth as the incubation time is much larger 
than the growth time of a single bilayer (that is, tg, < 1), corresponding to 
f<1). Note the logarithmic scales. In both a and b, Vin is taken as 0.4nm sy 
which is the average interface velocity corresponding to nanowires grown at 
470°C. 
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deduced from the model above: tgp~(2RnNw/Vp)[(1 —k)/(k—ke)]. 
The extrapolation of the correlation between the equilibrium partition 
coefficient, k,, and vp obtained in laser resolidification”’ to our experi- 
mental conditions yields a diffusive velocity of ~2 X 10''nms | at 
470°C (ke = 2.6 X 10 °), corresponding to Tp, ~ 7 ns. Interestingly, 
the time between two layers is ~0.75 s, thus suggesting that the incuba- 
tion time is very much larger than the time needed for the step to grow 
across the whole nanowire diameter, in qualitative agreement with in 
situ TEM observations*”°”?”°. Similarly, at the lowest growth temper- 
ature (k.=6.4X 10 ”), the estimated tpy, is ~18ns. A reduction in 
growth temperature is anticipated to affect not only the rate of the ther- 
mally activated atomic jumps, but also other growth parameters that are 
critical for the incorporation of the catalyst atoms, as discussed below. 

Using a step-flow growth model that takes catalyst dissolution into 
consideration, we derived an expression for the Al content in the 
nanowire as a function of growth parameters (Supplementary Inform- 
ation section G). Note, however, that the elucidation of the kinetically 
controlled atomic processes involved in the incorporation of Al would 
require a deep understanding of Al-induced Si surface reconstructions 
(Supplementary Information section H). From our analysis, it emerges 
that the second derivative of the Gibbs free energy of the liquid evalu- 
ated at the equilibrium composition, g’, and the characteristic super- 
saturation at which the nucleation occurs deterministically, u., are the 
two important parameters that influence catalyst injection. The quan- 
tity gi’ can be modified by adjusting Si solubility, whereas high y, 
implies a high nucleation barrier. In principle, both parameters can 
be engineered through control of the catalyst chemistry (by adding 
impurities, for instance), thus providing the possibility of tailoring the 
properties of nanowires. 

This flexibility in fabrication and functionalization of nanowires 
could be greatly enhanced owing to the availability of other catalysts 
that can act as dopants—for example, Bi (which yields n-type doping”'””) 
or Ga (which gives p-type doping**)—if their injection during the 
growth of Si or other group IV nanowires can be controlled. Our 
observations and predictions provide motivation to pursue the synthesis 
and characterization of atomically controlled nanowires with potentially 
adjustable morphology and physical characteristics that might offer 
new routes for catalytic assembly of nanowire-based devices. 


METHODS SUMMARY 


The growth of Al-catalysed Si nanowires was accomplished by using monosilane, 
SiH4, (diluted to 5% in argon) as a precursor in the temperature range ~400-470 °C 
(refs 10, 15). The partial pressure of the monosilane was held below 0.15 mbar 
during growth. Although the temperature used is below the Al-Si macroscopic 
eutectic temperature, the growth here is believed to be via the vapour-liquid-solid 
mechanism involving undercooled Al-Si nanodroplets. The morphology of Al-Si 
nanowires was characterized using an FEI dual-beam Nanolab 600 scanning elec- 
tron microscope, a Philips CM 20T TEM operating at 200 kV, a JEOL JEM-4010 
TEM operating at 400 kV, and an aberration-corrected FEI TITAN 80-300 analy- 
tical scanning TEM operating at 300 kV, which yields a spatial resolution of about 
100 pm in both modes (TEM and scanning TEM). Additionally, this TEM is also 
equipped with an EDX detector having a detection limit of about 0.5 at.%. We use an 
ultraviolet laser-assisted LEAP (local electrode atom probe) tomograph (LEAP 
4000XSi, Cameca). The three-dimensional reconstructions and statistical analysis 
of concentration frequency were performed using Cameca’s IVAS program. For the 
preparation of nanowire LEAP specimens, we have developed a focused ion beam 
(FIB)-based damage-free methodology to attach individual nanowires to commer- 
cially available Si microtips (Supplementary Information section B). The Si micro- 
tips were subsequently inserted into the LEAP tomograph’s ultrahigh-vacuum 
chamber and cooled to 60K before pulsed ultraviolet-laser-assisted evaporation 
analyses. To confirm that the preparation and laser-assisted evaporation do not 
affect the accuracy of the analysis, a reference sample consisting of an Al-capped Si 
needle was also analysed under exactly the same conditions, and the results demon- 
strate that the needle is Al-free. The binomial analysis was performed by sampling 
blocks of 200 atoms. The entire volume was first divided into columns along the z 
axis, with a cross-section in x- and y- directions, so that the volume dxdydz contains 
200 atoms on average. To obtain blocks with exactly 200 atoms, each column with 
cross-section dxdy was then cut into sections with 200 atoms as dz is varied. 
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July 2012 Greenland melt extent enhanced by 


low-level liquid clouds 


R. Bennartz', M. D. Shupe?, D. D. Turner’, V. P. Walden’, K. Steffen”, C. J. Cox*, M. S. Kulie®, N. B. Miller® & C. Pettersen® 


Melting of the world’s major ice sheets can affect human and envir- 
onmental conditions by contributing to sea-level rise. In July 2012, 
an historically rare period of extended surface melting was observed 
across almost the entire Greenland ice sheet'”, raising questions 
about the frequency and spatial extent of such events. Here we show 
that low-level clouds consisting of liquid water droplets (‘liquid 
clouds’), via their radiative effects, played a key part in this melt 
event by increasing near-surface temperatures. We used a suite of 
surface-based observations’, remote sensing data, and a surface 
energy-balance model. At the critical surface melt time, the clouds 
were optically thick enough and low enough to enhance the down- 
welling infrared flux at the surface. At the same time they were 
optically thin enough to allow sufficient solar radiation to penetrate 
through them and raise surface temperatures above the melting 
point. Outside this narrow range in cloud optical thickness, the 
radiative contribution to the surface energy budget would have been 
diminished, and the spatial extent of this melting event would have 
been smaller. We further show that these thin, low-level liquid clouds 
occur frequently, both over Greenland and across the Arctic, being 
present around 30-50 per cent of the time*°. Our results may help to 
explain the difficulties that global climate models have in simulating 
the Arctic surface energy budget’ ’, particularly as models tend to 
under-predict the formation of optically thin liquid clouds at super- 
cooled temperatures°—a process potentially necessary to account 
fully for temperature feedbacks in a warming Arctic climate. 

Over the past few decades, the Arctic has experienced warming amp- 
lified by a set of positive feedbacks; these feedbacks include increased 
sea ice melt, increased atmospheric water vapour and cloudiness, and 
changes in atmospheric circulation patterns"’. This warming has resulted 
in an increased extent of surface melt of the Greenland ice sheet (GIS) 
observed by satellite since 1979’". July 2012 set a new record in melt 
extent, with melting observed over nearly the entire GIS'”. At Summit 
Station, a brief melting period was observed on 11 July 2012 (Fig. 1). Ice- 
core records from the same location indicate such events occur only 
about once every 150 yr (ref. 1) on average, with the last occurring in 
1889". The July 2012 melt event was triggered by advection of unusually 
warm air, with temperatures at 500 m above ground only slightly below 
freezing (Fig. 1). However, the observed surface melt at Summit cannot be 
explained by warm air advection alone, because the surface temperature 
over the GIS is controlled by a balance of radiative and turbulent heat 
fluxes'*"'® that are sensitive to cloud and atmospheric properties. 

The local energy balance at the surface can be understood by 
accounting for three forcings on surface temperature, namely, the 
net radiative flux divergence at the surface, heat exchange with the 
atmosphere, and heat exchange with the underlying ice. A simple 
parametric model of these processes can be written as: 


OT: 1 Ta — Ts T,—Ts 
= Sa0F AFyer 4 : : t 2 (1) 
ot CppH Ts Tj 

|_| -—’ 


Radiative forcing Atmospheric heat exchange Ice heat exchange 


This model is used here to study the effect of clouds on the temporal 
development of surface temperature. In equation (1), Ts is the surface 
temperature, t is time, H is the height of the inversion layer, « is a shape 
parameter characterizing the form of the temperature profile in the 
boundary layer, p is the average air density of the boundary layer, cy is 
the specific heat of air, T, is the temperature at the top of the inversion, 
T; is the ice temperature at a depth low enough to be considered 
constant over the model integration time, AFygr is the net radiative 
flux divergence at the surface, and t, and 7; are relaxation timescales 
for heat exchange processes in the atmosphere and ice, respectively. 
Whereas the two heat exchange terms are non-negligible, the radiative 
forcing plays a dominant role in the development of surface temper- 
ature’, which justifies the basic parameterizations of heat fluxes using 
relaxation times. In reality, atmospheric heat fluxes are complex func- 
tions of wind speed, turbulence state, temperature, and moisture pro- 
files. The heat transfer into the surface depends on the snow’s thermal 
conductivity but also on radiative heat exchange between different 
snow layers and solar radiation penetrating deeper into the snow 
layer'’. The dependency of t, on atmospheric stability is simply model- 
led by making 1, increase with increasing atmospheric stability, that is, 
Ta =f(T, — Ts). The model’s response is not very sensitive to the 
choices of some of the model parameters; these parameters were fixed 
at H = 210m, a = 2,1; = 15h, and T; = —40 °C. The particular choices 
of these parameters, as well as details of the model, are justified and 
outlined in the Supplementary Information. 

The model assumes the temperature of the free atmosphere at the 
top of the inversion layer (T,,) to be a lateral boundary condition 
determined by advection. This temperature was determined from 
12-hourly radiosonde observations at Summit. The individual radi- 
ative fluxes constituting AF\pr, the net radiative flux at the surface, are: 


AFyer = Fey — Few + Few —Flw (2) 


The up and down arrows indicate the upwelling and downwelling 
radiation, respectively. The surface temperature model can be forced 
with observed or simulated radiative fluxes. Both approaches were 
performed for this study. For simulating fluxes, a simple atmospheric 
radiative transfer model was devised, which is outlined in the 
Supplementary Information. The shortwave component of the model 


(Fay) includes the radiative effects of water vapour, ozone, carbon 


dioxide, cloud liquid water, and surface albedo. The downwelling long- 


1 


wave component (Fw) accounts for thermal emission by atmo- 


spheric gases and cloud liquid water. The upwelling longwave 
radiation was simulated simply using Stefan—Boltzmann’s law and a 
surface emissivity of unity, that is, Fly =osp Tg, where os is the 
Stefan-Boltzmann constant. The radiative transfer model does not 
account for ice clouds. 

Figure la provides an overview of the temporal development of Ty, 
the temperature at a height of two metres, both observed and simulated 
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Figure 1 | Observed and simulated temporal evolution of the July 2012 
surface melting event at Summit. a, Temporal evolution of the temperature 
2m above the surface (T>,,) for July 2012; b, expanded view of data from a for 
the extended GIS melting period (9-15 July 2012). In the top parts of a and 
b, the black curve shows T>,, observed by the NOAA meteorological (met.) 
tower, the red curve shows the temperature observed by radiosonde at 500 m 
above ground (T's9om), the green curve shows the development of T},, simulated 
using the surface energy balance model driven by observed radiative (obs. rad.) 
fluxes”®, and the yellow curve shows the development of T>, simulated using 
the surface energy balance model driven by simulated radiative (sim. rad.) 
fluxes based on water vapour path (WVP) and liquid water path (LWP). Values 


for July 2012 at Summit. For weaker surface inversions, this temper- 
ature will be very close to the surface temperature, T>,, ~ Ts. In the 
case of strong surface inversions, T>,, can be higher than Ts. Such cases 
might occur, for example, during the night under cloud-free condi- 
tions (for example, 14 July 2012). The cloud-free diurnal cycle of T3,,, is 
of the order of 10°C, with low temperatures during night caused by 
less incoming solar radiation and strong longwave energy loss. (We 
note that the solar zenith angle at the beginning of July at Summit is 
about 85° at local midnight and around 52° at local noon.) The surface 
melting period on 11 July 2012 was marked by low cloud bases, which 
occur very frequently over Summit. Low, liquid-bearing clouds were 
also present on 12 July 2012, followed by three cloud-free days. During 
the cloud-free period, and in particular on 13 July, warm air was still 
present over Summit but surface temperatures did not rise above 0 °C. 
A visual overview of cloudiness for July 2012 (as well as July 2010 
and 2011 for comparison) is provided in Supplementary Fig. 5 and 
Supplementary Table 2, based on radar and lidar observations. 
Despite its simplicity, the prognostic surface temperature model 
described in equation (1) reasonably captures the long-term variabi- 
lity, the phase and amplitude of the diurnal cycle, as well as its modu- 
lation by liquid clouds. Deviations occur in the presence of high ice 
clouds, which were not included in the simulations. Note that model- 
ling results based on surface radiative flux observations agree well with 
the results where the surface radiative fluxes were computed using 
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of these two last quantities are shown in the lower parts of a and b with separate 
axes: blue (red) dots show LWP (WVP) values observed by an upward looking 
microwave radiometer. c, Model sensitivity studies with respect to clouds and 
solar radiation. Yellow line, as in a and b; turquoise curve, T>,, without solar 
radiation; blue curve, T>, assuming a cloud-free atmosphere (LWP =0g m ”); 
purple curve, T>,, assuming an atmosphere with a very thick cloud with 
constant LWP of 500 g m ~. Inset, simulated maximum T>,, (corresponding to 
the position of the arrow ‘surface melting’ in b) as a function of an assumed 
constant LWP; red data point shows the observed T>,, and LWP at a time of 
+30 min around the time of maximum surface temperature (error bars, 

+1 s.d.). 


cloud observations and the radiative transfer model. This agreement 
confirms that ice clouds play a minor role in this particular study 
because the simulated radiative fluxes do not include effects of ice 
clouds whereas the observed fluxes do. The model also predicts the 
timing and strength of the surface melt event in July 2012 with rea- 
sonable accuracy (see Fig. 1b). 

The modulating role of low-level liquid-containing clouds on the 
surface energy balance can be understood by considering two compet- 
ing effects. First, these clouds reflect solar radiation to space, reducing 
shortwave energy available for surface warming and dampening the 
diurnal cycle. Second, low-level clouds radiate energy downwards in 
the infrared. This process is efficient even for very thin clouds. At 
liquid water path (LWP) values greater than 20gm_ ”, clouds become 
nearly completely opaque in the infrared’*”” in which case the down- 
welling longwave radiation is determined almost entirely by cloud 
temperature. For the dry atmosphere observed over the GIS, cloudy- 
sky downwelling longwave fluxes can easily be 100 W m * higher than 
those measured for cloud-free conditions. 

The relative balance of decreased solar radiation and increased 
downward longwave radiation depends on cloud optical properties, 
which are most strongly modulated by variations in LWP. Figure 1c 
shows model sensitivity studies for two extreme cases: no cloud (that is, 
LWP = 0gm ”) and an extremely thick cloud with LWP constant at 
500gm ~. In the cloud-free case, a strong diurnal cycle exists but 
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Figure 2 | Observed spatial distribution of clouds over Greenland on 11 July 
2012 and their effect on surface temperature. a, Near-true-colour satellite 
image of Greenland observed by MODIS (on the Aqua satellite) on 11 July 
2012, 14:55-15:05 utc (MODIS channels 1, 4, 3). b, False-colour image for the 
same time period highlighting liquid-bearing clouds in purple (MODIS 
channels 7, 2, 1). The star shows the position of Summit. White contour lines 
give terrain height in 500-m intervals starting at 1,000 m. c, The maximum 
simulated temperature under cloud-free conditions (see text for details). The 
blue line corresponds to the 0 °C isothermal—that is, the height below which 
melting would occur under cloud-free conditions. This line is also shown in 
a and b as a reference. 


near-surface temperatures do not rise above —4°C. For the thick 
cloud, near-surface temperature follows the cloud temperature but 
does not exceed a value of —3 °C. 

In neither of these two extreme cases does the near-surface temper- 
ature rise to values above the melting point. Only for a limited range of 
intermediate LWP values do the combined longwave and shortwave 
radiative effects conflate to push temperatures above 0 °C. Indeed, the 
energy balance model indicates that surface melting only occurs in this 
case for LWP values between 10 and 40gm ~* (see Fig. Ic inset). 
Within this range, the cloud is opaque enough to ‘trap’ longwave 
radiation but still sufficiently thin to allow enough solar radiation to 
penetrate through to the surface. Clouds of this nature probably affec- 
ted broad spatial areas of the GIS during this event. Figure 2a and b 
provide an overview of the cloud situation over the GIS on 11 July 
2012, observed by the Moderate Resolution Imaging Spectrometer 
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Figure 3 | Frequency of occurrence of thin, liquid-bearing clouds. For the 
purpose of this plot, ‘thin, liquid-bearing’ clouds are defined as clouds in the 
range of l0gm *<LWP <60gm ”, corresponding to the range of 
maximum enhanced cloud radiative forcing at the surface. a-d, Comparisons 
of ground-based observed (blue, microwave radiometer (MWR)) and ERA- 
Interim simulated (red, ERA) frequencies of occurrence of these clouds for four 
Arctic observation sites for all seasons; a, Barrow, Alaska; b, Surface Heat 
Budget of the Arctic Ocean (SHEBA) experiment (a research vessel frozen in 
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(MODIS) on NASA’s Aqua satellite, and show extended liquid clouds 
over large parts of the central GIS. 

Surface melting would not have occurred over Summit in July 2012 
without the optically thin, liquid-water-containing clouds. However, 
temperatures were warm enough during this time to trigger melting 
without clouds at lower-elevation regions of the GIS. The diurnal cycle 
and the total amount of incoming solar radiation also vary with lati- 
tude. A second sensitivity study was carried out to estimate the relative 
impact of these two factors—that is, terrain height above sea level and 
latitude—on surface melting. Figure 2c shows the result of this sensi- 
tivity study, assuming cloud-free conditions and adjusting the model 
(equation (1)) for height effects on atmospheric temperature, down- 
welling longwave radiation, and incoming solar radiation as function 
of latitude. For all temperature dependencies on surface height, a 
climatological mean temperature lapse rate of 7.1K km’ ' was used”°. 
Terrain slope and orientation were not considered, as terrain slopes on 
top of the GIS are generally small. This sensitivity study is not expected 
to reproduce the exact conditions observed over Greenland, but it 
does provide insight into the interplay of terrain height, available solar 
radiation, and clouds. The maximum simulated temperature (Fig. 2c) 
did not exceed 0°C at terrain heights above roughly 2,700 m for 
regions south of Summit. At more northerly locations, where less solar 
radiation is available, these terrain heights fell below 2,000 m (see the 
blue 0 °C isothermal in all three panels of Fig. 2). In comparison, for a 
homogeneous cloud of 30 gm”, the model predicted melting every- 
where (not shown). For this example model exercise, the spatial extent 
of GIS melt conditions increased from 61% to 100% as a result of the 
optically thin clouds. 

This study highlights the intricate role of clouds in modulating the 
energy balance at the surface above the GIS in boreal summer. Thin, 
boundary-layer, liquid-containing clouds can change the surface 
energy balance towards warmer or colder surface temperatures 
depending on cloud optical thickness. In the case examined here, these 
clouds played a critical role in enhanced surface melt at Summit, 
Greenland, and probably contributed strongly to the 97% spatial melt 
extent observed over the GIS in early July 2012’. 

Thin, liquid-bearing clouds are ubiquitous across the Arctic’. 
Ground-based observations and model reanalysis products from the 
European Center for Medium Range Weather Forecasting (ERA- 
Interim) show occurrence frequencies ranging from 20% to more than 
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the pack ice to study the ocean and atmosphere); c, Eureka, Nunavut (research 
base on Ellesmere Island, Canada); and d, Summit, Greenland. DJF, 
December-February; MAM, March-May; JJA, June-August; SON, 
September—November. e, Circumpolar map of the frequency of occurrence of 
these clouds from 32 yr of ERA reanalysis (1979-2011). The plot in e is 
conditionally sampled to only include cases with solar zenith angle lower than 
80° and a surface albedo higher than 0.5; locations of the observation sites 
(a-d) are indicated. 
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50% of the time for four observational sites in summer and for most 
observational sites also in spring and autumn (Fig. 3a—d). (See Sup- 
plementary Information section 6 for details on the temporal coverage 
of the different observation sites; please also note that ERA-Interim 
histograms shown in Fig. 3a-d are restricted to the time period when 
observations are available.) Based on ERA-Interim reanalysis, Fig. 3e 
shows the occurrence of thin, liquid-bearing clouds in situations where 
the surface albedo is larger than 0.5 and at least some solar radiation is 
available (solar zenith angle less than 80°). These situations are most 
conducive to enhanced surface warming by the combined effects of 
solar and infrared warming discussed above***. Thin liquid clouds 
occur most frequently over sea-ice-covered areas north of 80°N, as 
well as over the Beaufort Sea and the east Siberian Sea, highlighting the 
importance of such clouds not only for the GIS but also for the surface 
energy balance over sea ice. 

To simulate the surface energy balance correctly, it is imperative for 
climate models to accurately represent these liquid-containing clouds 
and their occurrence fraction, radiative properties and responses to a 
changing Arctic climate system. The comparisons provided in Fig. 3a—d 
highlight a significant shortcoming in our current modelling cap- 
abilities of Arctic clouds. ERA reanalysis provides mostly reasonable 
results for thin, liquid-cloud occurrence in summer months. However, 
the model significantly underestimates the occurrence of such clouds 
in winter and spring for most sites and for Summit in all seasons. A 
similar underestimation of the occurrence of thin, liquid clouds was 
recently reported® for global climate models based on comparisons 
with satellite-derived estimates of cloudiness. The discrepancies were 
attributed largely to the simplified treatment of cloud phase in models, 
which typically does not allow for liquid clouds to form at tempera- 
tures lower than about —20°C. A similar phase determination scheme 
is employed in the ERA-Interim data, potentially explaining the sig- 
nificant low bias in modelled liquid-cloud occurrence over Summit 
(Fig. 3d). These shortcomings bias the Arctic surface energy balance 
predicted by present-day weather and climate models’, and limit the 
ability of these models to predict the cloud response to Arctic climate 
change and possible feedbacks. 


METHODS SUMMARY 


Ground-based infrared, microwave, radar and lidar remote sensing observations, 
as well as radiosonde data, were collected as part of the project ‘Integrated char- 
acterization of energy, clouds, atmospheric state, and precipitation at Summit’ 
(ICECAPS)’. Cloud liquid-water path estimates were derived from multi-channel 
microwave measurements’. Downwelling longwave fluxes were derived from 
high-resolution infrared observations and cloud-free fluxes were calculated on 
the basis of radiosonde observations”. In addition, shortwave and longwave 
upwelling and downwelling radiative fluxes were observed from a suite of broad- 
band radiometers”*”’, while temperatures at a height of two metres were observed 
from a meteorological tower operated by NOAA. All Summit, SHEBA, and Eureka 
ground-based data are available from the NOAA ESRL site (ftp://ftp1.esrl.noaa. 
gov/psd3/arctic/). Barrow data are available from the DOE ARM archive (http:// 
www.archive.arm.gov/). MODIS satellite observations were obtained from NASA’s 
Level 1 and Atmosphere Archive and Distribution System. ECMWF ERA Reana- 
lysis data are available from the ECMWF data server (http://www.ecmwf.int). 
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Bread wheat (Triticum aestivum, AABBDD) is one of the most 
widely cultivated and consumed food crops in the world. However, 
the complex polyploid nature of its genome makes genetic and func- 
tional analyses extremely challenging. The A genome, as a basic gen- 
ome of bread wheat and other polyploid wheats, for example, 
T. turgidum (AABB), T. timopheevii (AAGG) and T. zhukovskyi 
(AAGGA™A”), is central to wheat evolution, domestication and 
genetic improvement’. The progenitor species of the A genome is 
the diploid wild einkorn wheat T. urartu’, which resembles culti- 
vated wheat more extensively than do Aegilops speltoides (the 
ancestor of the B genome’) and Ae. tauschii (the donor of the D 
genome’), especially in the morphology and development of spike 
and seed. Here we present the generation, assembly and analysis of a 
whole-genome shotgun draft sequence of the T. urartu genome. We 
identified protein-coding gene models, performed genome structure 
analyses and assessed its utility for analysing agronomically impor- 
tant genes and for developing molecular markers. Our T. urartu 
genome assembly provides a diploid reference for analysis of poly- 
ploid wheat genomes and is a valuable resource for the genetic 
improvement of wheat. 

Bread wheat is one of the most important food crops worldwide, and 
provides about 20% of the calories consumed by humans’. To acce- 
lerate wheat improvement, a substantial amount of research has been 
conducted on the genome. The International Wheat Genome Sequen- 
cing Consortium aims at flow-sorting and sequencing the individual 
chromosomes of bread wheat, and significant progress has been made 
with several chromosomes, for example 3B (ref. 6) and 4A (ref. 7). 
More recently, a whole-genome shotgun sequence analysis of bread 
wheat and its diploid relatives® has allocated more than 60% of the 
genes to the A, B and D genomes with more than 70% confidence. The 
sequence of diploid progenitor genomes will allow the complete and 
unambiguous assignment of their homeologous relationships. 

We sequenced T: urartu accession G1812 (P1428198) using a whole- 
genome shotgun strategy on the Illumina HiSequation (2000) platform, 
and assembled the genome using SOAPdenovo (v. 1.05)’ with 448.49 
gigabases (Gb) of filtered high-quality sequence data (Supplementary 
Information). We estimated the genome size of T. urartu to be 4.94 Gb 
(Supplementary Information), which is consistent with previous reports 
of 4.8-5.7 Gb (refs 10, 11). The genome assembly reached 3.92 Gb with a 
contig N50 size (at which 50% of assembly was covered) of 3.42 kilobases 
(kb). After gap closure, the draft assembly was 4.66 Gb with a scaffold 
N50 length of 63.69 kb (Table 1 and Supplementary Information). The 
length of the contigs that contained intact or partial genes ranged from 


200 base pairs (bp) to 65.8 kb, with an average length of 9.91 kb. The 
assembly was evaluated by comparisons with published bacterial arti- 
ficial chromosome and expressed sequence tag (EST) sequences and by 
validation with PCR (Supplementary Information), and both indicated 
that the draft sequence had extensive genome coverage with high accu- 
racy. The distribution of GC content in the T. urartu genome was com- 
parable with those in the genomes of rice’*, maize’’, sorghum" and 
Brachypodium distachyon’* (Supplementary Information). 

Genome annotation of the assembly was performed as described in 
Supplementary Information. About 66.88% of the T. urartu assembly 
was identified as repetitive elements, including long terminal repeat 
retrotransposons (49.07%), DNA transposons (9.77%) and unclassified 
elements (8.04%) (Supplementary Information). The proportion of 
repetitive DNA was lower than the roughly 80% previously reported’’, 
which is probably due to a decreased incorporation of repeat sequence 
reads into the assemblies. 

To facilitate gene prediction, we generated a 116.65-megabase (Mb) 
transcriptome of T. urartu with 67.14Gb of RNA-Seq data from 
eight different tissues and treatments using the HiSequation (2000) 
platform, and 49,935 assembled transcripts from six tissues using the 
Roche 454 sequencing platform (Supplementary Information). These 
data, together with publicly available ESTs from hexaploid wheat, and 
homologues from sequenced grass genomes'*'*””, were used as evid- 
ence in gene prediction (Supplementary Information). In total, we 
predicted 34,879 protein-coding gene models. The average gene size 
was 3,207 bp, with a mean of 4.7 exons per gene, which was similar to 
that found for B. distachyon (5.2)'° but slightly higher than that of 
rice (3.8)'*, maize (4.1)? and sorghum (4.3)'*. In comparison with 
the 28,000 genes estimated for the A genome of hexaploid wheat’, 
our gene set for T. urartu contained 6,800 more members, indicating 
a more complete representation of genes in our analysis. However, 
the different approaches used in this work and in a previous study’, 
and the extensive loss of genes in the hexaploid A genome com- 
pared with its diploid progenitor®, may also have contributed to this 
difference. 

We also obtained 14,222,170 small RNA (sRNA) reads (18-30 bp) 
representing 4,369,970 unique sRNA tags. In total, 412 conserved and 
24 new microRNAs (miRNAs) distributed into 116 families were iden- 
tified. Comparison with the miRNAs of five monocots and five dicots 
showed that 73 miRNA families were specific to monocots, of which 23 
were uniquely present in T. urartu. We predicted 244 target genes for 
these miRNAs and found that the target gene (TRIUR3_06170) of 
miRNA MIR5050 responded to cold treatment, which provides a 
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Table 1 | Sequencing and draft genome assembly statistics 


Assembly step DNA library insert size (bp) Effective data (Mb) N50 (bp) Total length (Mb) Nst+ (Mb) Number >2 kb 
Contig construction 167-796 357,810 3,422 3,920 - 385,430 
Scaffold construction 167-20,000 90,682 64,532 - 1,258 - 

Gap closure* 167-796 357,810 63,687 4,660 738 81,689 


* Final statistics of scaffolds after gap closure. 
+The added N number during scaffolding. 


new resource for investigating the regulation of cold adaptation 
through miRNA (Supplementary Information). 

The gene families of T. urartu were compared with those of rice’’, 
maize’, sorghum" and B. distachyon’* using OrthoMCL"™ (Sup- 
plementary Information). We identified 24,339 families in the five 
grasses. Of these, 9,836 families, which contained 68,464 genes, were 
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Figure 1 | Comparison of gene families and coding domains of T. urartu 
with rice, maize, sorghum and B. distachyon. a, Venn diagram illustrating 
shared and unique gene families (gene numbers in parenthesis) among the five 
grass species. b, Comparison of protein families with different Pfam domains 
among the five species. Fourteen Pfam domains that had significant differences 
(P< 0.001, Fisher’s exact test) in their percentages between T. urartu and the 
other four grass species are shown. The percentage was calculated by dividing 
the number of each domain by the total gene set in a given species. 
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common to all five species. Another 1,103 families, containing 
3,425 genes, were specific to T. urartu (Fig. 1a). GO analysis of the 
T. urartu-specific families revealed that 556, 230 and 841 genes were 
involved in biological processes, cellular compounds and molecular 
functions, respectively. In total, 2,067 Pfam domains were shared 
among the five species. Of these, 14 Pfam domains had differences 
in member numbers in T. urartu compared with the other four grasses 
(Fig. 1b). These included NB-ARC and serine-threonine/tyrosine- 
protein kinase domains that were markedly increased in T. urartu, 
and C3HC4 RING-type and pathogenesis-related transcriptional 
factor/ERF domains that were significantly decreased. However, deter- 
mination of the significance and accuracy of these differences will 
require more detailed analysis. 

Given that NB-ARC domain proteins function mainly in disease 
resistance’’, we analysed the genes encoding R proteins in the T. urartu 
genome and identified 593 such genes, which were more abundant 
than in B. distachyon (197), rice (460), maize (106) and sorghum (211) 
(Supplementary Information). In contrast with barley genome data”’, 
the ratio of NBS-LRR type genes in T. urartu (1.21%) was also sub- 
stantially higher than that in barley (0.73%). These analyses indicate 
that there was a specific expansion of R genes in the T. urartu genome. 

The scaffolds and gene models of T. urartu were assigned to chromo- 
somes by using genetically mapped bread wheat ESTs”° as queries to 
search for homologous sequences in the T. urartu assembly (Sup- 
plementary Information). A total of 8,715 scaffolds, harbouring 14,578 
genes (41.8% of the total predicted genes) were mapped to 45 chromo- 
somal regions of the wheat A genome. Syntenic alignments between the 
T. urartu and B. distachyon genomes were constructed by using a set of 
14,578 orthologous genes (Fig. 2a). These gene-based alignments con- 
form, and supply more details, to the broad framework of genome 
synteny between wheat and B. distachyon proposed previously’’. 

The 4.94-Gb T. urartu genome is more than 18 times larger than the 
272-Mb genome of B. distachyon. Given that the average gene size of 
T. urartu is similar to that of B. distachyon, and the predicted gene 
number (34,879) is only about 1.37-fold that of B. distachyon 
(25,532), the larger genome size of T. urartu might be due to increased 
intergenic spaces. We therefore compared the intergenic space of the 
syntenic blocks between T. urartu and B. distachyon (Supplementary 
Information). About 21% of T. urartu genes had similarly sized inter- 
genic spaces to those in B. distachyon, but most T. urartu genes were 
separated by a greatly increased intergenic space enriched in Gypsy and 
Copia retrotransposons, and were present in separate scaffolds 
(Fig. 2b). This provides the genome sequence-scale evidence for the 
role of repeat expansion in genome size enlargement during the evolu- 
tion of the tribe Triticeae. 

We next demonstrated the utility of the T. urartu draft genome 
sequence for finding agronomically important genes through identify- 
ing the T. urartu homologue of OsGASR7, a gibberellin-regulated gene 
that controls grain length in rice’’. We found two haplotypes (H1 and 
H2) for TuGASR7 in 92 diverse T. urartu accessions collected from 
different regions. H1 was significantly associated with greater values of 
grain length and grain weight (Fig. 3 and Supplementary Information). 
We also found natural variation of TaGASR7 in bread wheat, with the 
elite variant associated with improved yield-related traits, suggesting 
that TaGASR7 is of use for the improvement of wheat yield (Sup- 
plementary Information). 

The T. urartu assembly also served as a rich resource for the develop- 
ment of genetic markers for molecular breeding through genomic 
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selection. We identified 739,534 insertion-site-based polymorphism 
(ISBP) markers and 166,309 simple sequence repeats (SSRs) (Sup- 
plementary Information). PCR validation showed that 94.5% of the 
SSRs and 87% of the ISBP markers gave the expected products, and that 
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Figure 2 | Synteny analysis between 
T. urartu and B. distachyon, and 
genome expansion in T. urartu. 

a, Syntenic relationships between the 
seven hypothetical chromosomes of 
T. urartu (1A-7A) and the five 
chromosomes of B. distachyon (Bd1- 
Bd5). The deletion bin maps of bread 
wheat 1A-7A chromosomes are 
noted at the bottom. b, Comparison 
of intergenic spaces in one 
representative syntenic block from 
T. urartu and B. distachyon. The 50- 
kb block from B. distachyon 
chromosome 1 (Bd1) contains five 
genes (indicated by red arrowheads). 
The T. urartu orthologues (yellow 
arrowheads) of the five genes are 
distributed in five separate scaffolds 
(totalling 1,092 kb). The total 
intergenic space in the displayed 

T. urartu region is more than 20-fold 
that in the corresponding Bd1 region. 


scaffold46726 191,374 bp scaffold28828 118,039 bp 


33.61% of the SSRs and 10.19% of the ISBP markers were specific to the 
A genome. Moreover, 28.7% of the SSR loci were polymorphic in bread 
wheat (Supplementary Information). To enable the identification of 
single nucleotide polymorphisms (SNPs), we re-sequenced another 
T. urartu accession (DV2138) and obtained 78.6 Gb of high-quality 
data. Comparison of the genome data between the two T. urartu acces- 
sions (G1812 versus DV2138) allowed the discovery of 2,989,540 
SNPs, which will be useful for the future development of SNP markers 
(Supplementary Information). 

Previous studies have revealed that more than half of the 60 meta- 
quantitative trait loci (meta-QTLs)”” related to wheat yield traits are 
present in the Agenome of bread wheat, and three meta-QTLs 
(MQTL_5, 6 and 7) are located on chromosome 5A (ref. 22). We 
therefore searched the T. urartu scaffolds using available markers 
around the three meta-QTLs (Supplementary Information). We found 
ten scaffolds with a total length of 772,014 bp that were distributed in 
the 14-centimorgan (cM) region of MQTL_5; nine scaffolds with a 
combined length of 783,140 bp were located in the 15-cM region con- 
taining MQTL_6; and six scaffolds with an overall length of 529,604 bp 
were assigned to the 9-cM region harbouring MQTL_7. The sequence 
information of these scaffolds will expedite the development of more 
polymorphic markers within the three meta-QTL regions and facilitate 
the identification of their corresponding genes. 


Figure 3 | Association analysis of the TuGASR7 gene. a, Indel differences 
between the TuGASR7 haplotypes H1 and H2 compared with their 
corresponding regions in the GASR7 homologues from barley (Hordeum 
vulgare; HvGASR7) and T. aestivum (TaGASR7). b, Phylogenetic relationships 
among the homologous genes of rice GASR7 from different grass species. 
TRIUR3_28594 is the predicted homologue of GASR7 in T. urartu. c, Typical 
grain lengths for the two detected TuGASR7 haplotypes, H1 (top panel) and H2 
(bottom panel). For each haplotype, the seeds from four representative 

T. urartu accessions were photographed. To illustrate the difference in grain 
length more clearly, ten seeds were randomly selected from each of the 
compared accessions and aligned closely for the photograph. 
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Our T. urartu draft genome sequence provides new insights into the 
A genome that is shared by many polyploid wheat species. The large set 
of gene models (34,879) and abundant genetic markers anchored in 
sequence scaffolds, together with the emerging genomic resources 
from bread wheat’, promise to accelerate deeper and more systematic 
genomic and breeding studies of bread wheat that are required to meet 
the future challenges of food security and sustainable agriculture. 


METHODS SUMMARY 


The genome of T.urartu accession G1812 was sequenced on the Illumina 
HiSequation (2000) platform. These data were used to assemble the draft genome 
sequence with the use of the SOAPdenovo’ software. RNA-Seq data were gener- 
ated on the same platform and Roche 454 for genome annotation and transcrip- 
tome analysis. Repeat sequences were identified through sequence similarity at the 
nucleotide and protein levels”*. Protein-coding genes were predicted by using an 
ab initio approach, sequence similarity search and RNA-Seq data to build reliable 
gene models”. Detailed methodology descriptions are given in Supplementary 
Information. 
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Aegilops tauschii draft genome sequence reveals a 
gene repertoire for wheat adaptation 
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About 8,000 years ago in the Fertile Crescent, a spontaneous hybridi- 
zation of the wild diploid grass Aegilops tauschii (2n = 14; DD) with 
the cultivated tetraploid wheat Triticum turgidum (2n = 4x = 28; 
AABB) resulted in hexaploid wheat (T. aestivum; 2n = 6x= 42; 
AABBDD)’”. Wheat has since become a primary staple crop world- 
wide as a result of its enhanced adaptability to a wide range of 
climates and improved grain quality for the production of baker’s 
flour’. Here we describe sequencing the Ae. tauschii genome and 
obtaining a roughly 90-fold depth of short reads from libraries with 
various insert sizes, to gain a better understanding of this genetic- 
ally complex plant. The assembled scaffolds represented 83.4% of 
the genome, of which 65.9% comprised transposable elements. We 
generated comprehensive RNA-Seq data and used it to identify 
43,150 protein-coding genes, of which 30,697 (71.1%) were uniquely 
anchored to chromosomes with an integrated high-density genetic 
map. Whole-genome analysis revealed gene family expansion in 
Ae. tauschii of agronomically relevant gene families that were asso- 
ciated with disease resistance, abiotic stress tolerance and grain 
quality. This draft genome sequence provides insight into the envir- 
onmental adaptation of bread wheat and can aid in defining the 
large and complicated genomes of wheat species. 

We selected Ae. tauschii accession AL8/78 for genome sequencing 
because it has been extensively characterized genetically (Supplemen- 
tary Information). Using a whole genome shotgun strategy, we generated 
398 Gb of high-quality reads from 45 libraries with insert sizes ranging 
from 200bp to 20kb (Supplementary Information). A hierarchical, 
iterative assembly of short reads employing the parallelized sequence 
assembler SOAPdenovo* achieved contigs with an N50 length (mini- 
mum length of contigs representing 50% of the assembly) of 4,512 bp 
(Table 1). Paired-end information combined with an additional 18.4 Gb 
of Roche/454 long-read sequences was used sequentially to generate 
4.23-Gb scaffolds (83.4% were non-gapped contiguous sequences) with 
an N50 length of 57.6kb (Supplementary Information). The assembly 
represented 97% of the 4.36-Gb genome as estimated by K-mer analysis 
(Supplementary Information). We also obtained 13,185 Ae. tauschii 
expressed sequence tag (EST) sequences using Sanger sequencing, of 
which 11,998 (91%) could be mapped to the scaffolds with more than 
90% coverage (Supplementary Information). 

To aid in gene identification, we performed RNA-Seq (53.2 Gb fora 
117-Mb transcriptome assembly) on 23 libraries representing eight 
tissues including pistil, root, seed, spike, stamen, stem, leaf and sheath 


2,10,11 


(Supplementary Information). Using both evidence-based and de novo 
gene predictions, we identified 34,498 high-confidence protein-coding 
loci. FGENESH* and GeneID models were supported by a 60% 
overlap with either our ESTs and RNA-Seq reads, or with homologous 
proteins. More than 76% of the gene models had a significant match 
(Evalue < 10°; alignment length = 60%) in the GenBank non- 
redundant database. An additional 8,652 loci were predicted as low- 
confidence genes as a result of incomplete gene structure or limited 
expression data support (Supplementary Information). We also predicted 
a total of 2,505 transfer RNA, 358 ribosomal RNA, 35 small nuclear RNA 
and 78 small nucleolar RNA genes (Supplementary Information). 

We found that more than 65.9% of the Ae. tauschii genome was 
composed of different transposable element (TE) families (Supplemen- 
tary Information). About 5 X 10° Illumina reads of Ae. tauschii were 
mapped to hexaploid wheat repetitive sequences and we found that a 
comparable percentage of reads (more than 62.3%) could be classified 
as part of a TE sequence (Supplementary Fig. 6). This estimate is 
similar to that derived froma previous survey of Roche/454 sequences’. 
There were 410 different TE families, of which the 20 most abundant 
contributed more than 50% of the Ae. tauschii gnome (Supplemen- 
tary Table 9). A single peak of increased insertion activity was estimated 
to occur about 3-4 Myr ago by measuring the similarity of the assembled 
LTR retrotransposons (Supplementary Information), suggesting that 
the expansion of the Ae. tauschii genome was relatively recent and coin- 
cided with the abrupt climate change during the Pliocene Epoch’. 

We constructed a high-density genetic map using an F, population 
of 490 individuals derived from a cross between the Ae. tauschii 
accessions Y2280 and AL8/78. The map, whose total length was 
1059.8 centimorgans (cM), consisted of 151,083 single nucleotide poly- 
morphism (SNP) markers developed by restriction-site-associated 
DNA (RAD) tag sequencing technology (Supplementary Fig. 13). 
Together with bin-mapped wheat ESTs’, SNPs and tags'®, the genetic 
map was used to align 30,303 scaffolds (1.72 Gb; 30,697 genes) to 
chromosomes (Supplementary Information). The Ae. tauschii genes 
and scaffolds were also anchored to barley’ and Brachypodium chro- 
mosome maps"? (Fig. 1 and Supplementary Fig. 17). Calculation of 
K,/K; ratios (the ratio of non-synonymous substitutions to synony- 
mous substitutions) for pairs of conserved orthologous genes showed 
that the average values between Ae. tauschii and barley (20,892 genes), 
Brachypodium (17,231 genes), rice (16,370 genes) and sorghum 
(18,623 genes) were 0.2214, 0.1888, 0.1736 and 0.1726, respectively, 
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Table 1 | Overall statistics of sequencing and genome assembly 


Assembly process Library insert size (bp) Read length (bp) Effective data (Gb) N50 (bp) N50 number Total length (Mb) Gaps (Mb) 
Contig assembly 167-764 44,75, 100, 150 270 4,521 179,145 3,528 = 
Scaffolding 2,000-20,000 44, 49,90 128 58,011 19,405 4,244 1,122 
Gap closure 167-764 44,75, 100, 150 270 57,585 19,455 4,229 701 
114-263 65 
~600* 18 


* Reads from 454 sequencing platform. 


which indicated that most gene lineages evolved under purifying selec- 
tion in Ae. tauschii. A total of 628 genes exhibited K,/K, ratios of more 
than 0.8 when compared with the other four species, indicating poten- 
tial positive selection (innermost circle of Fig. 1). These genes were 
assigned to a wide range of molecular functions by using Gene 
Ontology (GO) analyses (Supplementary Table 14). 

Ae. tauschii proteins were clustered with those of Brachypodium, 
rice, sorghum and barley (full-length complementary DNAs), and 
formed 23,202 orthologous groups (at least two members; Supplemen- 
tary Information). In total, we identified 11,289 (barley/Ae. tauschii) 
and 14,675 (Brachypodium/Ae. tauschii) orthologous gene pairs. We 
found that 8,443 gene groups contained sequences from all five grass 
genomes, and 234 were specific to Pooideae (Ae. tauschii, Brachypo- 
dium and barley) and 587 were specific to Triticeae (Ae. tauschii and 
barley) (Fig. 2a). Enrichment analyses of both Pfam domains and GO 
terms showed that genes encoding NBS-LRR (nucleotide-binding-site 


Q ~~ 
Sv 
ont && gioo 
Emg2 22 
ems 69 
o m5 QF 
£M@6 o> 
Gm ES 
So 
3) 
rt? 


Figure 1 | Comparative analysis of Ae. tauschii ordered scaffolds versus 
barley and Brachypodium. The inner circle represents the seven Ae. tauschii 
chromosomes scaled according to the genetic map incorporating genome 
scaffolds. Red points show the K,/K;, ratios between anchored Ae. tauschii genes 
and their putative orthologues in Brachypodium. Moving outwards, the second 
circle compares Ae. tauschii against the seven barley chromosomes’. The 
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leucine-rich repeat) proteins were over-represented in Ae. tauschii 
relative to Brachypodium and rice’ (Supplementary Information). 
These observations are consistent with those reported in a recent 
study’*. A total of 1,219 Ae. tauschii genes were similar to NBS-LRR 
genes (R gene analogues (RGAs))''* (Supplementary Information). 
This number is double that in rice (623) and sixfold that in maize 
(216), indicating that the RGA family has substantially expanded 
in Ae. tauschii. We mapped 878 RGAs (72%) to specific positions across 
wheat chromosomes by using molecular marker-genome sequence 
alignment, which provides a large number of potential disease resist- 
ance loci for further investigation. 

We found more genes for the cytochromeP450 family in 
Ae. tauschii (485) than in sorghum (365), rice (333), Brachypodium 
(262) or maize (261). This family of genes is important for abiotic 
stress response, especially in biosynthetic and detoxification pathways". 
Using 178 manually curated cold-acclimation-related genes such 


heatmaps show the density distribution of barley cDNA loci that are aligned 
with Ae. tauschii genes. The outer two circles illustrate Brachypodium 
chromosomes according to conserved synteny with Ae. tauschii. The coloured 
lines below each chromosome identify putative orthologous gene pairs between 
Ae. tauschii genes, barley genes and Brachypodium genes. 
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Figure 2 | Ae. tauschii gene families and transcription factors. 

a, Distribution of orthologous gene families in Ae. tauschii, Brachypodium, 
sorghum, rice and barley. The number of gene families is represented in each 
intersection of the Venn diagram. The first number under the species name 
indicates the total number of genes annotated for a particular species, and the 


as the CCAAT-binding factor (CBF) transcription factors'®, late- 
embryogenesis-abundant proteins (LEA) and osmoprotectant biosyn- 
thesis proteins (Supplementary Information) as queries, we identified 
216 cold-related genes in the Ae. tauschii genome, in contrast to 164 
genes in Brachypodium, 132 in rice, 159 in sorghum and 148 in maize. 
Some of these genes were specific to Ae. tauschii or to Pooideae, includ- 
ing those encoding ice-recrystallization inhibition protein 1 precursor, 
DREB2 transcription factor o isoform and cold-responsive LEA/RAB- 
related COR protein. Expression analysis of RNA-Seq data showed 
that most of these Ae. tauschii-specific and Pooideae-specific genes 
were constitutively expressed in Ae. tauschii (Supplementary Fig. 23). 
In addition, 1,489 transcription factors (TFs) in 56 families were iden- 
tified by using Pfam DNA-binding domains (Supplementary Informa- 
tion). Ae. tauschii had an excess of such TFs as MYB-related genes 
(103, in contrast with 66 in Brachypodium and 95 in maize), and these 
are also thought to be involved in various stress responses'”. The M-type 
MADS-box genes (58, in contrast with 23 in Brachypodium and 34 
in maize) are involved in regulation in plant reproduction’® (Fig. 2b 
and Supplementary Table 18). ARACNe”’ co-expression analysis using 
RNA-Seq data predicted an expression network of 1,283 interactions 
(Supplementary Fig. 25), in which 13 TFs were associated with the 
expression of drought tolerance genes”’ (Supplementary Table 20). 

We predicted a total of 159 (133 families) previously undescribed 
microRNAs (Supplementary Information), and identified segmental 
and tandem duplications in 42 members of the miR2118 family 
that were organized into two groups on 15 scaffolds (Supplementary 
Fig. 26). The miR399 family, which is involved in the regulation of 
inorganic phosphate homeostasis in rice’, was expanded (20 members 
in Ae. tauschii, compared with 11 in rice and 10 in maize), and may 
contribute to the ability of Ae. tauschii to grow in low-nutrient soils. 
The expansion of the miR2275 family (eight members in Ae. tauschii, 
compared with two in rice and four in maize) may contribute to 
the enhanced disease resistance of Ae. tauschii because phased short 
interfering RNAs initiated by miR2275 have been implicated in 
these activities”. 

The Ae. tauschii genome served as the source for many grain qua- 
lity genes in hexaploid wheat, creating a step improvement in the form- 
ation of the elastic dough essential for bread making’. Grain quality 
genes include high-molecular-weight glutenin subunits (HMW-GS), 
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second indicates the number of genes in groups for that organism. The 
difference between the two accounts for singleton genes that were not present in 
any cluster. b, The composition of transcription factors (TFs) in Ae. tauschii 
and Brachypodium composed of more than 30 members. 


low-molecular-weight glutenin subunits (LMW-GS)”, grain texture 
proteins (GSP; puroindolines)* and storage protein activator (SPA). 
We identified two HMW-GS genes, five LMW-GS genes, one Pina gene, 
two Pinb genes, one GSP gene and one SPA gene in the Ae. tauschii 
genome sequence (Supplementary Information). As has been shown 
for the Hardness (Ha) locus”, the GSP, Pina and Pinb genes were 
also organized in a cluster. RNA-Seq analysis showed that these grain 
quality genes were expressed predominantly in seeds (Supplemen- 
tary Fig. 29). 

The anchoring of more than 40% of the scaffold sequences to four 
genetic maps and to syntenic regions of other sequenced grass species 
provided a structural framework for integrating multiple maps by 
using shared markers (Fig. 1 and Supplementary Information). The 
co-localization of genes in scaffolds and genetically mapped quanti- 
tative trait loci (QTLs) will directly support map-based gene cloning. 
On chromosome 2D, for example, the locations of 33 QTLs or genes 
were integrated with scaffold information (http://ccg.murdoch.edu.au/ 
cmap/ccg-live/) (Fig. 3 and Supplementary Information). Alignment 
of the Ae. tauschii genetic map with the wheat 2D consensus genetic 
map was unambiguous, with the exception of some single crossovers 
that were probably due to repetitive elements (dotted lines in Fig. 3). 
The genome sequence also provided the basis for the identification 
of more than 860,126 simple sequence repeats (SSRs), with trimers 
(37.7%) and tetramers (27.5%) as the most abundant SSR types (Sup- 
plementary Information). Together with the 711,907 SNPs identified 
by resequencing a roughly fivefold coverage of a second accession, 
Y2280 (Supplementary Information), the genomic resources reported 
here will promote map-based gene cloning and marker-assisted selec- 
tion in wheat. 

With its high base accuracy and nearly complete set of gene 
sequences, the Ae. tauschii draft genome sequence provides an essen- 
tial reference for studying D genome diversity by re-sequencing addi- 
tional accessions. Over the past half century, the introduction of new 
D genome diversity into synthetic wheat has been a major effort to 
expand bread wheat genetic diversity and to create environmentally 
resilient lines**’’. The Ae. tauschii genome sequence should aid in 
identifying new elite alleles for agriculturally important traits to 
alleviate the worsening plight of global climate and environment 
changes’. 
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Figure 3 | An integrated genetic map of Ae. tauschii chromosome 2D. The 
Ae. tauschii genetic map was integrated with markers, scaffolds and mapped 
QTLs to assist in marker development and map-based cloning. Left: the 

Ae. tauschii molecular map used for synteny alignment in Fig. 1 was aligned to 
chromosome 2D (November 2011 consensus map, CMap; http:// 
ccg.murdoch.edu.au/cmap/ccg-live) where sequence information was available. 
The original marker at a location is retained in CMap as a synonym. Right: 
within CMap, details for QTL locations are provided at a greater magnification 
to show all the markers in the regions of interest. The dotted lines indicate an 
ambiguous relationship that is most probably due to repetitive sequences. 


METHODS SUMMARY 


We selected Ae. tauschii (2n = 14) accession AL8/78 for sequencing. Plants were 
grown at 25 °C ina darkened chamber for two weeks; DNA was extracted from leaf 
tissue and purified with a standard phenol/chloroform extraction protocol. 
Sequencing libraries were constructed and sequenced on Illumina next-generation 
sequencing platforms (GAII and HiSequation (2000)). High-quality reads were 
assembled with SOAPdenovo’. Repeat sequences were identified by combining de 
novo approaches and sequence similarity at the nucleotide and protein levels. Gene 
models were predicted by combining homology-based, de novo and RNA-Seq- 
based methods. RNA-Seq reads were assembled with CAP3 (ref. 28) and CD-Hit” 
and were mapped to the draft genome with Tophat*. See Supplementary 
Information for details and additional analyses. 
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The emergence of functional microcircuits in visual 


cortex 


Ho Ko!*, Lee Cossell!*, Chiara Baragli', Jan Antolik'+, Claudia Clopath?, Sonja B. Hofer'* & Thomas D. Mrsic-Flogel'* 


Sensory processing occurs in neocortical microcircuits in which 
synaptic connectivity is highly structured’* and excitatory neu- 
rons form subnetworks that process related sensory information”*. 
However, the developmental mechanisms underlying the forma- 
tion of functionally organized connectivity in cortical microcir- 
cuits remain unknown. Here we directly relate patterns of 
excitatory synaptic connectivity to visual response properties of 
neighbouring layer 2/3 pyramidal neurons in mouse visual cortex 
at different postnatal ages, using two-photon calcium imaging in 
vivo and multiple whole-cell recordings in vitro. Although neural 
responses were already highly selective for visual stimuli at eye 
opening, neurons responding to similar visual features were not 
yet preferentially connected, indicating that the emergence of fea- 
ture selectivity does not depend on the precise arrangement of local 
synaptic connections. After eye opening, local connectivity reorga- 
nized extensively: more connections formed selectively between 
neurons with similar visual responses and connections were elimi- 
nated between visually unresponsive neurons, but the overall con- 
nectivity rate did not change. We propose a sequential model of 
cortical microcircuit development based on activity-dependent 
mechanisms of plasticity whereby neurons first acquire feature 
preference by selecting feedforward inputs before the onset of sen- 
sory experience—a process that may be facilitated by early elec- 
trical coupling between neuronal subsets’ °—and then patterned 
input drives the formation of functional subnetworks through a 
redistribution of recurrent synaptic connections. 

Intrinsic and experiential factors guide the patterning of neural 
pathways and the establishment of sensory response properties during 
postnatal development’®’. During this time, neural circuit refinement 
is thought to depend on the elimination of initially exuberant projec- 
tions, selective formation of new connections or both'*'*. However, the 
mechanisms governing the emergence of structured connectivity in 
local cortical microcircuits, where dendrites and axons overlap exten- 
sively, remain uncertain. Moreover, it is not clear whether the organi- 
zation of synaptic connections between nearby neurons is established 
early and inherently linked to the formation of receptive fields (RFs) 
before the onset of sensory experience or whether the mature patterns 
of recurrent connectivity appear only after the formation of RFs as a 
result of correlated activity induced by feedforward drive from the 
sensory periphery. Here we investigate these questions in networks 
of layer 2/3 (L2/3) pyramidal cells in mouse primary visual cortex 
(V1)—where neighbouring neurons exhibit a diversity of preference 
for visual features”'*'’—by determining how local synaptic connec- 
tivity relates to visual response properties during development. 

Stimulus-selective responses in V1 are observed at eye opening 
but the extent to which detailed RFs are established by this time 
remains unclear. To characterize the spatial RF structure of L2/3 neu- 
rons in V1 at eye opening (postnatal days (P) 14-15) and in more 
mature mice (P28-35), we used in vivo two-photon calcium imaging” 


18-22 
> 


in monocular V1 to obtain spatial RFs by reverse correlation'®'”** of 


static natural images and spiking responses inferred from calcium 
signals*> (Methods and Fig. la, b), and fitted a two-dimensional 
Gabor function to the RFs (Supplementary Fig. 1). At eye opening, 
L2/3 neurons exhibited a diversity of RF spatial structures that 
resembled those of mature V1 neurons (Fig. 1c and Supplementary 
Fig. 1a). The proportion of neurons with significant linear RFs was 
similar between the two age groups (P14-15, 60%, 191 of 317; P28-35, 
58%, 201 of 348; P=0.51, ~ test; Fig. 1d), as was the angle of visual 
space subtended by RFs (mean visual angle along the long RF axis + 
s.d.: P14-15, 29.3 + 13.6°; P28-35, 29.4 + 10.3°; P= 0.12, rank-sum 
test; Fig. le; see also Supplementary Fig. 1b, c). The similarity of RF 
structures was shown by the overlapping distributions of standard RF 
measures of ,.and n,, which respectively express the width and length 
of the fitted Gabor function in units of the underlying grating period 
(median n,: P14—-15, 0.31; P28-35, 0.32; P = 0.14; median ny: P14-15, 
0.20; P28-35, 0.20; P=0.41; rank-sum test; Methods and Sup- 
plementary Fig. 1d). For comparison, the orientation tuning of neu- 
rons responsive to drifting gratings was slightly but significantly 
broader at eye opening than in more mature V1 (median orientation 
selectivity index: P13-15, 0.62; P22-26, 0.68; P = 2.39 X 10 *4, rank- 
sum test; Supplementary Fig. 2a, b). Neuronal responses at eye open- 
ing, however, exhibited higher variability to repeated presentation of 
the preferred grating stimulus (coefficient of variation (CV)) than in 
more mature animals (mean CV = s.d.: P13-15, 0.93 + 0.36; P22-26, 
0.71 + 0.30; P = 1.17 X 10°74, rank-sum test; Supplementary Fig. 2c), 
indicating that responses in immature animals were less reliable des- 
pite the presence of clearly defined RFs and orientation selectivity. 
The similarity of spatial RF properties in immature and older V1 
suggests that the organization of feedforward connections” was largely 
established by the time of eye opening. We next tested whether the 
synaptic connectivity of neurons in the local cortical network is func- 
tionally specific when vision begins. We combined in vivo two-photon 
calcium imaging in V1 and subsequent multiple whole-cell recordings 
in slices of the same tissue® (Fig. 2). We first imaged calcium signals at 
consecutive depths within L2/3 to characterize the responses to natural 
movies and drifting gratings (Methods) of all neurons within a volume 
of approximately 285 um X 285 um X 40-120 um. We then carried 
out simultaneous whole-cell recordings in vitro from two to six neigh- 
bouring L2/3 pyramidal neurons separated by less than 50 um (mean 
distance + s.d.: P13-15, 24 + 9 um; P22-26, 25 + 10 1m; Fig. 2a, b). 
We recorded from 143 and 140 neurons in total in the slices at P13-15 
and P22-26, respectively, which were identified in the in vivo image 
stack by image registration based on affine transformation’ (Fig. 2a). 
Synaptic connectivity was assessed by evoking action potentials in each 
neuron sequentially while searching for the presence of excitatory 
postsynaptic potentials in the other neurons (Fig. 2b). This approach 
allowed us to relate the probability of finding connections between 
pairs of L2/3 neurons to the correlation of their average responses to 
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Figure 1 | Responses of L2/3 pyramidal cells in mouse visual cortex are 
highly feature selective at eye opening. a, Example of OGB-labelled region at 
P14 (left; scale bar, 30 um) with calcium transients of two cells obtained with 
two-photon microscopy (bottom; scale bars, 20s, 10% AF/F) in response to 
natural image sequences. b, Linear receptive fields (RFs) of the neurons in 


natural movies (signal correlation; Fig. 2c, d, f, h) and to the differences 
in their preferred orientation (Fig. 2g, i). We used natural movie signal 
correlation for comparison of response similarity not only because it 
was a good predictor of the similarity of the neurons’ linear RFs 
(Supplementary Fig. 3), but because it also probably captures the simi- 
larity of feature selectivity in neurons with nonlinear RFs, which could 
not be estimated by reverse correlation. 

The overall rate of connectivity was not significantly different 
between the two age groups (P13-15, 16.4%, 58 of 353 connections 
tested; P22-26, 21.7%, 64 of 295 connections tested; P = 0.09, a test; 
Fig. 2e). Among the neurons recorded in vitro, 73% (104 of 143) and 
56% (79 of 140) exhibited significant responses to the natural movie 
in vivo (Methods). As we reported previously’, among responsive 
L2/3 pyramidal cells the connection probability increased steeply with 
increasing signal correlation at P22-26 (P= 4.6 X 10 *, Cochran- 
Armitage test; Fig. 2f). This was not the case at P13-15 (P = 0.092), 
when a much weaker trend was observed. Specifically, there were twice 
as many connections between highly correlated neuronal pairs (signal 
correlation, =0.1) in older V1 than at eye opening (P13-15, 19.4%, 
19 of 98 tested; P22-26, 41.5%, 22 of 53 tested; P = 0.0035, va test; 
Fig. 2f). Therefore, the functional selectivity of synaptic connections 
increased in the period after eye opening, as more connections formed 
selectively between neurons responding to similar stimulus features. 

We further examined the refinement of connection specificity by 
relating the connection rate between reliably responsive, orientation- 
tuned neurons (P13-15, 43.4%, 62 of 143 neurons; P22-26, 57.9%, 81 
of 140; Fig. 2g and Methods) to the difference in their preferred ori- 
entation. A significant decreasing relationship between connection 
probability and the difference in preferred orientation was present in 
more mature V1 but not at eye opening (P13-15, P = 0.27; P22-26, 
P= 0.034; Cochran—Armitage test; Fig. 2g). Together, these results 
indicate that at eye-opening the organization of synaptic connections 
between nearby L2/3 pyramidal neurons exhibits only weak functional 
specificity. After the onset of visual experience, connectivity increases 
specifically between neurons coding for similar visual features. 

Previous studies suggest that bidirectional recurrent connections 
are overexpressed in some cortical networks” and that they are most 
frequent between L2/3 pyramidal cell pairs with similar visual res- 
ponses in mature V1°. We examined whether a similar organization 
of bidirectional motifs is already present at eye-opening. In contrast to 
mature mice, visually naive mice did not exhibit a larger proportion of 
bidirectionally connected pairs between neurons with highly corre- 
lated responses to the natural movie (P13-15, P= 0.27; P22-26, 
P= 0.01; Cochran-Armitage test; Fig. 2h). Similar trends were found 
between neurons preferring similar orientations (P13-15, P = 0.13; 
P22-26, P=0.11; Cochran—Armitage test; Fig. 2i). Therefore, this 
statistical feature of pairwise connectivity also refines after eye opening, 
such that a greater proportion of neurons with similar visual responses 
become bidirectionally connected (Fig. 2h, i). 


a obtained by regularized reverse correlation (Methods). Scale bars, 20°. c, RFs 
of neurons from two mice at different ages. a.u., arbitrary units; red indicates 
ON subfield; blue indicates OFF subfield. d, e, Fraction of neurons with 
significant RFs (d, 7’ test) and RE size (e, rank-sum test) at eye opening and in 
more mature V1. Error bars, s.d.; P14-15, n = 4 mice; P28-35, n = 5 mice. 


We next tested for developmental changes in recurrent connectivity 
between neurons not reliably responsive to visual stimuli, which were 
encountered in similar proportions in both age groups (fraction of 
neurons not significantly responsive to repeated presentations of the 
natural movie: P13-15, 39%, 4,133 of 10,509; P22-26, 44%, 4,691 of 
10,662). At P22-26, non-responsive neurons connected to each other 
at much lower rates than responsive neurons (P < 0.01, Tukey’s HSD 
multiple-comparison test among proportions; Fig. 3a). At P13-15, 
however, responsive and non-responsive neurons formed recurrent 
connections at similar rates (P > 0.05; Fig. 3a). These data suggest 
that connections between L2/3 neurons not driven effectively by visual 
stimuli are selectively eliminated after eye opening (P < 0.01; Fig. 3a). 
This nonspecific-connection scheme at eye opening is consistent with 
the statistics of population activity during visual stimulation, which 
showed a twofold-higher total pairwise firing rate correlation at P13- 
15 than at P22-—26 (median correlation: P13-15, 0.044; P22-26, 0.021; 
P<10 °°’, rank-sum test; Fig. 3b). Higher activity correlations at eye 
opening may allow connections to be maintained between neurons not 
reliably driven by visual stimuli. These connections are then lost as 
activity in the V1 network becomes progressively less correlated. 

To obtain a mechanistic insight into the refinement of local recur- 
rent connectivity after eye opening, we constructed a network model of 
the neocortex based on activity-dependent synaptic plasticity’’. The 
model consisted of a recurrently connected cortical network of 18 excit- 
atory and 5 inhibitory integrate-and-fire neurons (Methods). Cortical 
neurons received feedforward input from 500 presynaptic neurons, a 
subset of which exhibited spatially correlated activity during each 
iteration of the simulation (Fig. 4a). The weights of both recurrent 
and feedforward synapses were updated by a voltage-based spike- 
timing-dependent plasticity (vSTDP) learning rule’’. Initially, neurons 
were seeded with RFs (Methods) and the weights of the recurrent 
network were drawn randomly from a uniform distribution (Fig. 4b, 
upper panels). Excitatory neurons in the recurrent network with the 
same RFs developed strong, mostly bidirectional connections (93.2%; 
proportions taken across 50 simulations; Fig. 4b, c). There was a strong 
decrease in connectivity between neurons that were not responsive to 
feedforward input (from 20.5% near the beginning of the simulation to 
0.6% at the end; Fig. 4d) but not between neurons that were both 
responsive to feedforward input (decrease from 26.0% to 20.7%), con- 
sistent with experimental observations (Fig. 3a). To compare the 
model’s behaviour further with our experimental data, we froze the 
feedforward and recurrent weights at three time points and measured 
the signal correlation between all responsive neuronal pairs. Higher 
signal correlations between neurons indicated a higher rate of connec- 
tivity at later but not earlier stages of network development (Figs 4e and 
2f). The model also predicted the increase in bidirectional connec- 
tions between neurons with high signal correlations at later stages of 
development (Figs 4e and 2h). These simulations suggest that feed- 
forward connection patterns determine the structure of recurrent con- 
nectivity by activity-dependent mechanisms of synaptic plasticity. 
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Figure 2 | Functionally specific connectivity between L2/3 pyramidal cells is 
not apparent at eye opening. a, Example triplet of neurons shown in a 
transformed in vivo image (left), in the brain slice (middle) and during whole- 
cell recordings (right). Scale bar, 30 tm. b, Membrane potential recordings 
from neurons shown in a. Evoked spikes and average traces of postsynaptic 
potentials. Dashed lines indicate timing of presynaptic spikes. Vertical scale 
bars, 80 mV for traces along the diagonal, 0.8 mV otherwise. c, Peristimulus 
time histogram of spikes inferred from calcium signals (arbitrary units) of the 
three neurons in response to a natural movie sequence (averages of six 
repetitions). d, Schematics of synaptic connectivity and in vivo signal 
correlations during natural movies for the three neurons in a. e, Overall 
connectivity rates at eye opening and in more mature V1; y’ test. f, Relationship 
between connection probability and signal correlation of neuronal pairs 
significantly responsive to the natural movie across age; Cochran—Armitage 
test. g, Relationship between connection probability and difference in preferred 
orientation (AOri) among pairs in which both neurons were responsive and 
orientation selective (orientation selectivity index >0.4).h, i, The probability of 
observing uni- or bidirectionally connected pairs as a function of either signal 
correlation (h) or AOri (i). P13-15, n = 13 mice; P22-26, n = 18 mice. 


Our results indicate that RFs exist before mature patterns of recur- 
rent connectivity. However, transient electrical coupling via gap junc- 
tions between clonally related neurons contributes to shared feature 
selectivity and raises the possibility that intracortical connectivity may 
precede and instruct RF formation”’”*. We extended our network 
model earlier in time to examine the mechanisms by which gap- 
junction coupling may influence the emergence of RFs and recurrent 
connectivity. In this model, early recurrent connectivity in the cortical 
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Figure 3 | Developmental elimination of recurrent connections between 
non-responsive neurons. a, Connection probability between neurons 
significantly responsive to the natural movie (R— R) and between non- 
responsive neurons (N—> N) at two ages. **P < 0.01, Tukey’s HSD multiple- 
comparison test among proportions; NS, not significant. b, Distribution of 
pairwise time-varying inferred firing rate correlation coefficients for all 
responsive cell pairs (to natural movies) separated by <50 um. ***P< 10 °°”, 
rank-sum test. 


network was provided by electrical gap junctions (Fig. 4f, g) and 
recurrent excitatory chemical synapses did not exist, approximating 
the organization of the mouse neocortex in the first postnatal week’. 
Early feedforward connections were randomly assigned (Fig. 4f, g) and 
their weights were updated according to the vSTDP rule. Cell pairs 
were more likely to stabilize the same set of feedforward inputs (that is, 
develop the same RF) if they had been connected by gap junctions 
(gap-junction coupled, 31.6%, 79 of 250 pairs; not gap-junction 
coupled, 4.1%, 305 of 7,400 pairs; P< 0.001, 7’ test; 50 simulations; 
Fig. 4h). We then removed the gap junctions, which disappear by the 
second postnatal week in mouse V1’, and assigned random recurrent 
synaptic connections to neurons in the cortical network (Fig. 4f, g). 
Neurons sharing the same RF formed strong synaptic connections 
(Fig. 4c, g). Therefore, the functional specificity of recurrent connec- 
tions was influenced indirectly by early gap-junction motifs (Fig. 4i), as 
electrically coupled neurons were first likely to develop the same RFs 
before becoming synaptically connected. 

Separate simulations initiated with chemical connections revealed 
that early modifiable bidirectional chemical connections had no influ- 
ence on either the formation of RFs or recurrent connectivity at the end 
of the simulation (probability of developing same RF: not connected, 
5.5%, 270 of 4,918 pairs; bidirectional chemical connections, 3.8%, 11 
of 288 pairs; P = 0.22, ¢ test; 50 simulations; Fig. 4h, i). Therefore, 
early initial biases in cortical connectivity may only influence func- 
tional circuit development if they exist as strong and stable connections 
when feedforward inputs are being selected. Although the absolute 
connectivity rates found experimentally were not perfectly replicated 
in the model (compare Fig. 2f with Fig. 4e), the connection probability 
between cells with similar visual responses was higher when assessed. 
deeper in the acute slice, where connections are more likely to be 
preserved during cutting (supplementary fig. 6 of ref. 5). 

We found that the functional specificity of local connections in 
mouse V1 was not apparent at eye opening despite the occurrence 
of highly selective responses to visual features. Although the overall 
rate of synaptic connectivity did not change after eye-opening, con- 
nections redistributed according to the following rules: more connec- 
tions were added preferentially between neurons responding most 
similarly to visual stimuli, whereas connections were eliminated 
between cells not reliably responsive to visual stimulation. This result 
is surprising given existing theories of neural circuit formation, which 
suggest either that connections are initially exuberant and are subse- 
quently ‘pruned’ or that synapse number increases after the onset 
of sensory experience’’"™*. Instead, in local L2/3 networks we find a 
balanced restructuring of connectivity after eye opening. It remains 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


a Goitiesl b Feedforward Recurrent Only Gus; g Recurrent h 
ortica connections ; z connections 
network model no chemical synapses 2 0.4 
i 4 2 
: u 
6 0.3 
: 0 : 2 
rT > ne} 
> E = 10 = 
= = I a 5 3 0.2 
ae A7IAS S 12 ie ii - be 
eee ae 2 6 ee 16 B04 
% peg S : o 7 
=f i 18 2 
vstor | © of 2 c 0 
ie : 88 
ge: 3 oe 3 fo) 
10 = 2 é Start connection 
12 4 8 
=] 

5 ZAAEN ie Buea Ree 2 ORO Bidirectional synapse 
RTT 18 2 Ss ai 
ee A 2 6 10 14 18 1) CaO Ganknaion 

Random chemical @ 
A Feedforward input Presynaptic neuron : e 
Feedforward input synapses; GJ removed @ i 0.6 - 
ae BB Bidirectional synapse a 2 : Unidirectional 
Synaptic U1 unidirectional synapse S x a 2 5 0Sf Il Bidirectional 
weight DI Gap junction c g 4 5 
ae 2 0.45 
Cc d e ; ; > 2 g & 
1 >04 > 1 Signal correlation § .n E 3 oat 
2 7 5 7 (Unidirectional as i 1 BE 
97 08 o @ 0.87 Ml Bidirectiona ‘ - 4 = 2 @02F 
ae 2 03 2 Functionally specific : Ex 
g 5 0.6 5 5 connectivity develops 7 t & oat 
c ° 5 0.2 5 (3) 1 
8 € 0.4 3 8 a x o: 0 
28 02 eo : 5 8 : O 
§ 3 fe fo} a) = at 
8 3 (6) 2 ; ie) 
s) os 0 gre APTN 
FOS Dod Bd s Ld 5 Te ee 2: G10 14 18 Start connection 
ww S as SP PP OPP = Raney Feedforward input Presynaptic neuron pe 
Early Intermediate Late 


Figure 4 | Feedforward input structure determines the functional 
organization of recurrent connectivity. a, Sketch of the network model of 
functional microcircuit development based on vSTDP (see text for details). At 
the start of the simulation, cortical neurons were randomly connected but 
received spatially clustered input from a subset of presynaptic neurons. Both 
feedforward and recurrent connections were updated according to the vSTDP 
rule (Methods). b, Synaptic weight matrices of feedforward (left, reordered for 
display purposes) and recurrent (right) connections from an example network 
at the beginning and end of the simulation. Recurrent synaptic connections 
were classified as weak (light grey), unidirectional (white) or bidirectional 
(black). ¢, Probability of observing weak, unidirectional or bidirectional 
connections, at the end of the simulation, between neurons that start with the 
same RF. d, Connection probability of responsive (R—> R) and non-responsive 
(N— N) neuronal pairs during and at the end of the simulation. e, Relationship 


to be seen whether similar mechanisms contribute to the elabora- 
tion of long-range connections in visual cortex of rodents and other 
mammals?*””?, 

Functionally specific connection patterns between L2/3 pyramidal 
cells seem to be instructed by feedforward input (for example from layer 4 
or the visual thalamus) only after RF formation. Our network model 
suggests that correlated firing driven by feedforward activity increases 
the functional specificity of recurrent connections by activity-dependent 
mechanisms of synaptic plasticity, which leads to the preferential 
formation of synapses between any neurons sharing similar RFs. The 
model can additionally explain how electrically coupled neurons early in 
development first develop similar feature selectivity and then preferential 
recurrent connectivity”””* (Fig. 4g-i). 

Our data suggest that functionally organized connectivity between 
L2/3 pyramidal neurons is not necessary for establishing elemental RF 
properties at eye opening. Instead, it may contribute to the amplifica- 
tion of visually driven responses and thereby increase the robustness 
and reliability of cortical representation of sensory input with age 
(Supplementary Fig. 2c), which may be facilitated by the preferential 
formation of bidirectional connections between neurons with similar 
stimulus preferences (Fig. 2h, i). The role of inhibition for the matura- 
tion of visual responsiveness remains to be determined. 

In conclusion, the patterning of recurrent cortical connectivity 
through the feedforward-driven, activity-dependent redistribution of 
connections may be a fundamental rule by which neurons link 
together into assemblies that process related information. 


between connection probability and feedforward input-driven signal 
correlation at three time points in the simulation. f, Sketch of different stages of 
the network model extended to earlier developmental times. GJ, gap junction. 
g, Synaptic weight matrices from the example gap-junction network model. 
The recurrent network is initially connected with gap junctions (yellow) in the 
absence of chemical synapses. With time, neurons selected a spatially clustered 
set of feedforward inputs (RFs). Gap junctions were then removed and 
recurrent chemical connections were randomly assigned. The simulation was 
then continued as in a and b. h, Probability of developing RFs from the same set 
of feedforward inputs for pairs with no recurrent connections, gap junctions or 
early bidirectional connections (data from separate simulations) at the start of 
the simulation. i, Probability of developing shared connections depended on 
the starting connectivity. Data in c-e, h and iare from 50 network simulations. 


METHODS SUMMARY 


Responses to different visual stimuli in anaesthetized mouse V1 L2/3 cells were 
measured using in vivo two-photon calcium imagining as previously described>”’. 
Receptive fields were obtained by reverse correlation using a regularized pseudo- 
inverse method”. Synaptic connections were assayed by in vitro whole-cell recor- 
dings of a subset of neurons imaged in vivo and re-identified as previously 
described*”°. The incidence, strength and short-term dynamics of synaptic con- 
nections were related to visual response properties across age (Figs 2 and 3 and 
Supplementary Fig. 4). The network model consisted of 23 recurrently connected 
integrate-and-fire type neurons (18 excitatory, 5 inhibitory) receiving 500 feedfor- 
ward inputs with neuronal dynamics and plasticity modelled as in ref. 27. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Animals and surgical procedures. All experimental procedures were carried out 
in accordance with institutional animal welfare guidelines and licensed by the UK 
Home Office. Experiments were performed on C57Bl/6 mice aged P13-15 and 
P22-35. Mice were initially anaesthetized with a mixture of fentanyl (0.05 mg 
kg~'), midazolam (5.0 mgkg  '), and medetomidine (0.5 mgkg '). At the time 
of imaging, the injectable anaesthetic had mostly worn off and light anaesthesia 
was maintained by isoflurane (0.3-0.5%) in a 3:2 mixture of O2:N,O delivered via a 
small nose cone. Surgery was performed as described previously’’. Briefly, a small 
craniotomy (1-2 mm) was carried out over primary visual cortex and sealed after 
dye injection with 1.6% agarose in HEPES-buffered artificial cerebrospinal fluid 
(ACSF) and a cover slip. 

In vivo two-photon calcium imaging. For bulk loading of cortical neurons, the 
calcium-sensitive dye Oregon Green BAPTA-1 AM (OGB-1 AM, Molecular Probes) 
was first dissolved in 4 ul DMSO containing 20% Pluronic F-127 (Molecular Probes), 
and further diluted (1/11) in dye buffer (150 mM NaCl, 2.5mM KCl and 10mM 
HEPES, pH 7.4) to yield a final concentration of 0.9 mM. Sulphorhodamine 101 (SR 
101, 50 1M, Molecular Probes) was added to the solution to distinguish neurons and 
astrocytes*’. The dye was slowly pressure-injected into the monocular region of right 
visual cortex at a depth of 170-200 1m with a micropipette (3-5 MQ, 3-10 p.s.i., 
2-4 min) under visual control by two-photon imaging (10 water immersion objec- 
tive, Olympus). Activity of cortical neurons was monitored by imaging fluorescence 
changes with a custom-built microscope and a mode-locked Ti:sapphire laser (Mai 
Tai, Spectra-Physics) at a wavelength of 830 or 930 nm through a 40 water immer- 
sion objective (0.8 NA, Olympus). Scanning and image acquisition were controlled by 
custom software written in LABVIEW (National Instruments). 

Visual stimuli were generated using the MATLAB (Mathworks) Psychophysics 
Toolbox’, and were displayed on an LCD monitor (60-Hz refresh rate) posi- 
tioned 20cm from the left eye, roughly at 45° to the long axis of the animal, 
covering ~105° X 85° degrees of visual space. At the beginning of each experi- 
ment, the appropriate retinotopic position in visual cortex was determined using 
small grating stimuli at 12-24 neighbouring positions. The monitor was reposi- 
tioned such that the preferred retinotopic position of most imaged neurons was 
roughly in the middle of the monitor. 

Imaging frames of 256 X 256 pixels were acquired at 7.6 Hz while different 
visual stimuli, including naturalistic images and movies, and drifting gratings 
(see below) were presented. After each recording, the focal plane and imaging 
position was checked and realigned with the initial image if necessary. In com- 
bined in vivo functional imaging and in vitro connectivity assaying experiments, to 
obtain visually evoked responses from all neurons in a cortical volume of approxi- 
mately 285 um X 285 tum X 40-120 jum, images were recorded at 7 to 18 cortical 
depths with a spacing of 7 tm, starting at ~110 jum below cortical surface, a depth 
that corresponds to superficial L2/3 in mouse V1. 

Image sequences were aligned for tangential drift and analysed with custom 
programs written in MATLAB and LABVIEW. Recordings with significant brain 
movements, vertical drift or both were excluded from further analysis. Cell out- 
lines were detected using a semi-automated algorithm based on morphological 
measurements of cell intensity, size and shape, and were subsequently confirmed 
by visual inspection. After erosion of the cell-based regions of interest (to minimize 
influence of the neuropil signal around the cell bodies), all pixels within each 
region of interest were averaged to give a single time course (AF/F), which was 
additionally high-pass-filtered at a cut-off frequency of 0.02 Hz to remove slow 
fluctuations in the signal. Spike trains were inferred from calcium signals using a 
fast non-negative deconvolution method which approximates the maximum 
a-posteriori spike train for each neuron, given the fluorescence observations”. 
This method yields spike probabilities (or inferred firing rate) linearly related to 
the number of action potentials per imaging frame”. 
Receptive field measurement. Receptive field data were acquired from four mice 
at eye opening (P14-15) and five mature mice (P28-35). Naturalistic image 
sequences (between 1,440 and 2,700 individual images) were presented on the 
monitor during two-photon calcium imaging. Images were shown at 2-s intervals 
(0.5-s presentation time, interleaved by a 1.5-s grey screen) for a total presentation 
time of between 0.83 and 1.5h. After the onset of each natural image, 15 imaging 
frames were recorded at 7.6 Hz before the next image was presented. For each cell 
in the imaged region, the response to an image was calculated in the following way. 
Spike probabilities were inferred from calcium signals using the fast non-negative 
deconvolution method described above. For each visual stimulus, k = 1, ..., N, and 
each cell, i= 1,...,C, the response to the stimulus can be expressed as r(k, i, j) 
where j = 1,...,15 are the 15 imaging frames. An average population response 
was calculated: R(j) = >°>xir(k,i,j)/NC. If the imaging frame, J, denotes the 
frame during which the peak average population response occurred (such that 
R(J) = max;{R(j)}), then the response of cell i to stimulus k was defined as 

J+1 i 

jay —itlks i, j)/3. 


LETTER 


To estimate linear RFs, a regularized pseudo-inverse method” was used for 
reverse correlating neuronal responses with images of natural scenes. This algo- 
rithm regularizes the inverse problem by introducing a two-dimensional smooth- 
ness constraint on the linear RF; namely, the constraint is that the Laplacian of the 
RF should be close to zero at all points (V7RE = 0). This method introduces a 
regularization parameter, A, which balances the emphasis to be placed on fitting 
the data and the emphasis to be placed on the smoothness constraint. 

Because this method introduces this free parameter, we performed the following 
analysis to choose the regularization parameter. For each cell and each regulariza- 
tion parameter, the naturalistic images and associated responses were separated 
into training (75% of the data) and test (remaining 25% of the data) data sets. 
Training data sets were chosen randomly and the remaining 25% of the data was 
placed into the test data set. Linear RFs were then calculated using the training 
data, and a sigmoid nonlinearity, described by the equation 


A 


i 1+ exp(—ax+ f) 


(where A is the amplitude, « determines the slope and /} determines the offset of 
the sigmoid), was then fitted to the training data to convert the linear predictions 
made by the RF into neuronal spike probabilities. Response predictions to the 
naturalistic images of the test data set were then made and the correlation coef- 
ficient between the actual and predicted responses was taken as a measure of RF 
prediction performance. This procedure was carried out for each cell and each 
regularization parameter 100 times. For each cell, the regularization parameter 
that maximized the RF prediction performance was chosen. 

To assess whether the RF for a particular cell was significant, the response vector 
to the naturalistic image sequence was randomly shuffled and the reverse correla- 
tion was performed again, using the same regularization parameter, 4. This pro- 
cedure was repeated 99 times to produce 100 shuffled RFs, RFghutfea- From these 
shuffled RFs, the mean, fshuffea and standard deviation, O.pufeas across all pixels 
were calculated. An RF was defined to be significant if there were pixels that had 
absolute values greater than [lghuffted + 60 shuffled: 

To fit the Gabor function, we used only the RFs determined to be significant by 
the previous analysis. The RF was parameterized in MATLAB by fitting a two- 
dimensional Gabor function using the Levenberg-Marquardt algorithm. The 
Gabor function is described by 


2 2 
G(x,y) =A exp ( a t cos (27fx' +9) 
x y 


where 


x’ = (x —c,) cos (0) —(y—cy) sin (0) 
y' =(x—cx) sin (0) +(y—c,) cos (0) 


These equations describe an underlying two-dimensional cosine grating parame- 
terized by 0 (orientation), f (spatial frequency) and ¢ (phase), which is enveloped 
by a two-dimensional Gaussian function parameterized by A (amplitude), c, and 
cy (centre of the Gaussian), and o,, and a, (standard deviations of the Gaussian 
perpendicular to and parallel to the axis of the grating, respectively). Gabor fits 
were individually inspected to make sure they matched the RF (some Gabor fits 
were excluded at this point because they did not provide a good match to the RF: 
P14-15, 3 of 191 (1.6%); P28-35, 6 of 201 (3%)). 

To quantify the shapes of RFs, the dimensionless measures n, = o,fand ny = o,f 
were used*°. These values express the size of the Gaussian envelope in terms of the 
wavelength of the underlying cosine grating. For instance, n, = 1 indicates that the 
standard deviation of the Gaussian perpendicular to the grating is equal to half a 
cycle of the underlying cosine grating. To get a measure of the size of the RF (Fig. 1 
and Supplementary Fig. 1), the visual angle subtended by the Gabor fit along the 
axis perpendicular to the direction of the cosine grating was calculated. That is, if 
the eye of the mouse is at (0, 0, 0) in space, the centre of the monitor is at (0, 0, d) 
(where d is the shortest distance to the mouse eye from the screen), the centre of 
the Gabor fit to the RF is at (c,, Cy d), and the angle of orientation of the cosine 
grating on the screen is 0, then the visual angle, x, subtended by the Gabor was 


calculated as 
( UsV ) 
a&= arccos{ ———— 
\|u|| |IvI| 


where u = (c, — o,cos(9), c, + o,sin(0), d) and v = (c, + ¢,cos(0), cy — o,sin(0), 
d). Receptive field similarity was calculated as the pixel-pixel correlation 
coefficient. 

Natural movie signal correlation. Natural movies consisted of 40-s sequences of 
either moving scenes in a mouse cage or compilations of The Life of Mammals 
(BBC), adjusted to 70% mean contrast, continuously looped six times. Visual 
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responsiveness to natural movies was determined by the following procedure. For 
all stimulus repetitions, inferred spike trains were moving-average-filtered with a 
time window of three frames (~0.394 s). The smoothed firing rates at correspond- 
ing points of the stimulus were then treated as groups and tested for differences by 
one-way analysis of variance. Neurons with a P value less than 0.01 (that is, those 
that exhibited consistently increased firing during at least one period of stimulus 
presentation) were considered significantly visually responsive. For pairs of sig- 
nificantly responsive neurons, the signal correlation was calculated as the 
Pearson’s correlation coefficient of the averaged responses to the stimulus. 
Orientation tuning. To measure the orientation preference and selectivity of 
neurons, square-wave gratings (0.035 cycles per degree, 2 cycles per second, 
100% contrast) drifting in eight different directions were randomly interleaved, 
with the grating standing still for 1.4-1.9 s before moving for 0.9-1.5 s (six repeti- 
tions per grating). Responsive neurons that exhibited consistently increased firing 
during at least one time point of presentation of each grating were identified by 
one-way analysis of variance. Among cells responsive to grating stimuli (P < 0.05), 
the mean inferred firing rate during the presentation ofa drifting grating was taken 
as the response to each stimulus. From each trial, we obtained one orientation 
tuning curve, and neurons were defined as reliably responsive if the mean cross- 
correlation between all pairs of curves obtained from different trials was greater 
than 0.1. Responses from different trials were then averaged to obtain the average 
orientation tuning curve for each neuron. This orientation tuning curve was then 
Fourier interpolated to 360 points, and the preferred direction was determined by 
the angle at which the interpolated tuning curve attained its maximum. The 
preferred orientation was taken as preferred direction modulo 180°. Orientation 
selectivity index (OSI) was calculated as (Rest — Rortho)/(Rbest + Rortho)» Where 
Rbest is the interpolated response to the preferred direction and Rortho is the average 
of interpolated responses to the directions orthogonal to the best-responding 
direction. When relating connection probability to orientation selectivity, neurons 
were defined as orientation selective if OSI > 0.4. To quantify neuronal response 
reliability, we calculated the coefficient of variation (s.d. divided by mean) from 
responses to the optimal grating direction. 

In vitro whole-cell recording. We carried out imaging experiments followed by 
whole-cell recordings in vitro at P13-15 and P22-26, using an approach described 
previously’. After two-photon calcium imaging of visual responses in vivo, small 
volumes of red fluorescent microspheres (Lumafluor) were injected into the 
imaged region to facilitate identification of the region in the slice tissue. The mouse 
brain was then rapidly removed to and dissected in ice-cold ACSF containing 
125mM NaCl, 2.5mM KCl, 1mM MgCh, 1.25mM NaH2PO,, 2mM CaCh, 
26mM NaHCO;, 25mM dextrose; 315-325 mOsm, bubbled with 95% O,/5% 
CO, pH7.4. Visual cortex slices (300|1m) were cut coronally (HM 650V 
Vibration Microtome, MICROM) and were incubated at 34 °C for 30 min before 
they were transferred to the recording chamber. The slice containing the imaged 
region was identified by the presence of OGB-1 green fluorescence and the red 
microsphere injection site. To reveal the relative locations of cells, a detailed 
morphological stack of the slice was acquired with a custom-built microscope 
and a mode-locked Ti:sapphire laser (Chameleon, Coherent), at a wavelength of 
830 nm, through a X 16 water immersion objective (0.8 NA, Nikon). Scanning and 
image acquisition were controlled by custom software written in LABVIEW. 

Whole-cell recordings from two to six cells were carried out in regions identified 
by visually comparing image stacks obtained in vivo and in vitro, using red fluor- 
escent microspheres and the pial surface as reference. Recordings were carried out 
in ACSF at 28 °C, using Multiclamp 700B amplifiers (Axon Instruments), and data 
was acquired using custom software running in IGOR PRO” (WaveMetrics Inc.) 
or MATLAB. Recording pipettes were filled with internal solution containing 
5mM KCl, 115mM K-gluconate, 10mM K-HEPES, 4mM MgATP, 0.3mM 
NaGTP, 10mM Na-phosphocreatine, 0.1% w/v biocytin, 40 uM Alexa Fluor 
594; 290-295 mOsm, pH 7.2. Junction potential was not corrected for. The chlor- 
ide reversal potential was about —85.2 mV. Cells were approached under visual 
guidance using laser-scanning Dodt contrast. After breakthrough, the presence of 
synaptic connections was tested by evoking five spikes at 30 Hz in each cell, 
repeated 30 to 120 times, while searching for postsynaptic responses. 

The paired-pulse ratio (PPR) was calculated as the amplitude of the second 
evoked excitatory postsynaptic potential (EPSP) divided by that of the first one. 
After connectivity mapping, step currents from 250 pA to 700 pA were injected in 
50-pA increments if the input resistance was smaller than around 400 MQ, and, if 
necessary (as indicated by early ceasing of firing due to inactivation of voltage- 
gated sodium channels), currents from —125 pA to 350 pA were injected in 25-pA 
increments for neurons with larger input resistance. Pyramidal neurons were 
identified from several parameters: their morphology in Alexa-594-filled image 
stacks (Fig. 2a); regular-spiking pattern on current injection; spike half-width 
(>1 ms); and, in the presence of connections, depolarizing postsynaptic potentials 
(Fig. 2b). To match the same neurons imaged in vivo and recorded from in vitro, 


we performed three-dimensional image registration of in vivo and in vitro image 
stacks by affine transformation using custom-written MATLAB software sub- 
sequent to the experiment. 

Connection probabilities were calculated as the number of connections detected 
over the number of potential connections assayed. Probabilities of unidirectional 
and bidirectional connections were calculated as the numbers of unidirectionally 
and, respectively, bidirectionally connected pairs divided by the total number of 
pairs. To relate connectivity to functional properties, the asymptotic Cochran- 
Armitage test for trend was used to test for the significance of linear trends”. Pairs 
in which a high-quality recording was achieved in only one cell (for example when 
the other cell was too depolarized or unhealthy, or the seal resistance was less than 
1 GQ) connectivity was assayed only in the direction from the unhealthy cell to the 
healthy cell, given that spikes could be evoked in both cells. Data from these pairs 
were included in the analysis of connection probability, but not in the analysis of 
the probability of finding bidirectional or unidirectional pairs. Only neuronal pairs 
in which both neurons were located at >60-1m depth from the slice surface and 
with an intersoma distance of <50 um were included in the analysis. 

The strength and short-term plasticity of synapses were also measured because 
these synaptic parameters affect the efficacy of presynaptic firing on postsynaptic 
partners. In P13-15 mice, connections between L2/3 pyramidal cells were signifi- 
cantly stronger (median EPSP amplitude: P13-15, 0.41 mV; P22-26, 0.20 mV; 
P=2.9X 10 *, rank-sum test; Supplementary Fig. 4a) and PPR was significantly 
lower (median PPR: P13-15, 0.87; P22-26, 1.13; P= 6.2 X 10+, rank-sum test; 
Supplementary Fig. 4b) than in P22-26 mice, in line with previous reports”. 
However, in neither age group was there significant correlation between EPSP 
amplitude or PPR, on the one hand, and signal correlation or difference in pre- 
ferred orientation, on the other (Supplementary Fig. 4c-f). Part of the connectivity 
data was published previously in ref. 5. 

Neuron model. In the network model, the dynamics of the membrane potential, 
u(t), of model neurons is described by the adaptive exponential integrate-and-fire 
model”: 


d 
cr = —gi(u—E,) +g dre 4" — wig +241 


Here C is the membrane capacitance, gy is the leak conductance, Ey, is the resting 
potential and J is the stimulating current. The exponential term describes the 
activation of sodium current. The parameter 4+ is called the slope factor and 
Vy is the threshold potential. A hyperpolarizing adaptation current is described 
by the variable wa, which has dynamics 


dwad 
T Waa dt. 
where 1,,,, is the time constant of the adaption of the neuron and a is a parameter. 
Upon firing, the variable u is reset to a fixed value, V;esets Whereas Wag is increased 
by an amount b. An additional current, z, which is set to a value I,, immediately 
after a spike occurs and otherwise decays with a time constant t,, such that 
dz 


> =z 


dt 


was used to account for spike afterpotentia 
adaptive threshold, Vy, which starts at Vr 
time constant ty,, such that 


= a(u—E,)— Waa 


I*°. Refractoriness is modelled with the 


after a spike and decays to Vr,,,, witha 


max rest 


dV: 
T= —(Vr— Vite) 


Tr “dt rest 

Parameters for the neuron model were taken from ref. 39, and t, was set to 40 ms in 
agreement with refs 36, 41 and was kept fixed throughout all simulations. We 
made the following choices: C = 281 pF, g: = 30nS, E, = —70.6 mV, 4; =2mV, 
Vojue = 750.4mV, ty,, = 144 ms, a= 4nS, b = 0.0805 pA, I,, = 400 pA, t, = 40 
ms, Ty, = 50 ms, Vr,,,, = —30.4mV. 

Plasticity model. Our plasticity model exhibits separate additive contributions to 
the plasticity rule, one for long-term depression (LTD) and another one for long- 
term potentiation” (LTP). Synaptic weights had hard bounds imposed at Wmin and 
Wmax- For the LTD part, we assumed that presynaptic spike arrival at synapse i 
induces the depression of the synaptic weight w; by an amount 
—Ayrp[u_(t) — 0_],. The brackets [...],. indicate rectification, that is, any value 
u— <0_ does not lead to a change*. The quantity 7_(f) is an exponential low- 
pass-filtered version of the postsynaptic membrane potential, u(t), with time 
constant t_, such that 


da_(t)_ 
ao —u_(t)+u(t) 


Because the presynaptic spike train is described as a series of short pulses at time t/, 
X(t) = 9>,0(t — t!'), where i is the index of the synapse and n is the index that 
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counts the spike, the depression—that is, the change in w; due to LTD (indicated 
by the superscript minus sign)—is given by 
dw; 


dt 


if w; > Wmin Where Ay rp(u) is an amplitude parameter that is under the control of 
homeostatic processes“ and u is the running average of depolarization of the 
postsynaptic neuron, averaged over a time scale of 1s. The time scale of 1s is 
not critical (100s or more would be more realistic for homeostasis), but is con- 
venient for the numerical implementation. 

For the LTP part, we assumed that each presynaptic spike at the synapse w; 
increases the trace, x;(t), of some biophysical quantity, which decays exponentially 
with a time constant t,. (refs 45, 46), such that 


dx;(t) 
E 


= —Arrp(u)X{(t)[u_ —0_],, 


or a — X(t) +Xi(t) 
where X;(f) is the spike train defined above. Potentiation—the change in w; due to 
LTP (indicated by the superscript plus sign)—is given by 


dw;* 
dt 


if W; < Wmax- Here Ayyp is a free amplitude parameter fitted to electrophysiology 
data” and i, (t) is another low-pass-filtered version of u(t), similar to u_ (t) but 
with a shorter time constant, t,.. Thus, positive weight changes can occur if the 
momentary voltage, u(t), surpasses a threshold, 0+, and, at the same time, the 
average value, i, (t), is greater than 0_. 
The final rule used in the simulation was 
dw; 


ae —Ayrp(@)X;(t)[u— —O_] , 


=AyrpX(t)[u—04], [44 —O_], 


+Arprpxi(t)[u—0 4] , [44 —0_] 


+ 


combined with hard bounds Wmin Sw; Wmax. For network simulations, 


Atrp(4)=Arrpu?/u2,, where ur, is a reference value. We made the 
following parameter choices for simulations: 0- = —70.6 mV, 0, = —45.3 mV, 
Aprp =14X10 °mV |, Aprp=8X10°mV 2, t,=15ms, t_=10ms, 
T+ =7ms. 


Network simulation. In all simulations, 500 presynaptic Poisson neurons with 
firing rates v}"* (i = 1,...,500) were connected to 18 postsynaptic excitatory neu- 
rons. The input rates v! followed a Gaussian profile, that is, 
vP™ = A exp(—(i—)/207), with variance ¢ = 10 and amplitude A = 30. The 
centre, 4, of the Gaussian shifted randomly every 100 ms between ten equally 
distributed positions, each position occurring with equal probability. Circular 
boundary conditions were assumed, that is, neuron i = 500 was considered to 
neighbour neuron i = 1. Five inhibitory neurons were each driven by 14 excitatory 
neurons and projected onto 11 excitatory neurons. These connections were chosen 
randomly and were fixed with a weight equal to 1. Feedforward connections onto 
the inhibitory neurons were drawn from a uniform distribution on the interval 
[0, 0.5] and were fixed for the duration of the simulation. The reference value was 
set to u2,, = 70 mV. Parameters for the feedforward connections were chosen as 
for the plasticity model. The excitatory recurrent connections were plastic under 
the same rule and with the same parameters as the feedforward connections, but 
the amplitudes Ay rp and Ayrp were reduced by a factor of 100. 

In the first set of simulations, feedforward weights were initialized with RF 
(weights taken from previous test simulations took values between 0 and 3, which 
were also the hard bounds). At the beginning of the simulation, recurrent excit- 
atory connection weights were drawn randomly from a uniform distribution on 
the interval [0,0.75] (hard bounds were set to 0 and 0.75). At each time point 
during the simulation, noise current (Gaussian white noise) was independently 
injected into each cell in the recurrent network. In this and all subsequent net- 
works, the model was run for 20s of simulated time to allow the homeostatic 
dynamical variables to settle before the recurrent synaptic weights were reinitia- 
lized. The simulations were then run for another 1,000 s. 

To calculate signal correlations in the network at three different time points (at 
the reinitialization of recurrent excitatory chemical synapses, 1s of simulation 
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time later and at the end of the simulation), the weights were frozen and the same 
stimuli were played to assess the firing rate correlations across neurons. Two 
neurons were considered to be bidirectionally connected if both synaptic weights 
between them were >0.6, and were considered to be unidirectionally connected if 
only one of the synaptic weights was >0.6. 

Responsiveness was determined in the following way: the feedforward weights 
onto each neuron were summed, to produce a single value for each cell. These 
values were then plotted ona histogram, which displayed a bimodal distribution. A 
threshold value was chosen between the two peaks of this bimodal distribution, to 
separate the cells into responsive and non-responsive. Feedforward weights were 
manually checked to make sure no cells with clear RFs were missed. Neuronal pairs 
whose RFs hada correlation coefficient of >0.85 were defined to have the same RF. 

The recurrent gap-junction network began such that neurons 1 and 2, 3 and 4, 
and 5 to 7 were electrically coupled together in the stated combinations, and was 
run in this state for 200 s. There were no chemical synapses during this time. The 
gap-junction model was taken from ref. 47. The current from neuron i to neuron j 
was defined as F(t) = a7 j6(t — ti) — geapluj(t) — u(t)], where we chose gga, = 2nS 
and chose « such that the spikelet was about 2 mV. The network was in this state for 
200s of simulation time (after the initial 20-s settling time), at which point gap 
junctions were removed and replaced with recurrent excitatory chemical connec- 
tions with weights drawn randomly from a uniform distribution on the interval 
[0, 0.75]. After this time point, Gaussian white noise current stimulation was again 
provided to each cell in the recurrent network. The network was then run for 
another 800s. 

The comparison chemical network was run in the same manner as the gap- 
junction network except that a recurrent chemical network, with weights drawn 
randomly from a uniform distribution on the interval [0, 0.75], replaced the recur- 
rent gap-junction network during the first 200 s, and there was no replacement of 
the recurrent weights after this 200 s. 
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Glutamine supports pancreatic cancer growth 
through a KRAS-regulated metabolic pathway 
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Cancer cells have metabolic dependencies that distinguish them 
from their normal counterparts’. Among these dependencies is 
an increased use of the amino acid glutamine to fuel anabolic 
processes”. Indeed, the spectrum of glutamine-dependent tumours 
and the mechanisms whereby glutamine supports cancer metabo- 
lism remain areas of active investigation. Here we report the iden- 
tification of a non-canonical pathway of glutamine use in human 
pancreatic ductal adenocarcinoma (PDAC) cells that is required 
for tumour growth. Whereas most cells use glutamate dehydro- 
genase (GLUD1) to convert glutamine-derived glutamate into a- 
ketoglutarate in the mitochondria to fuel the tricarboxylic acid 
cycle, PDAC relies on a distinct pathway in which glutamine- 
derived aspartate is transported into the cytoplasm where it can 
be converted into oxaloacetate by aspartate transaminase (GOT1). 
Subsequently, this oxaloacetate is converted into malate and then 
pyruvate, ostensibly increasing the NADPH/NADP* ratio which 
can potentially maintain the cellular redox state. Importantly, 
PDAC cells are strongly dependent on this series of reactions, as 
glutamine deprivation or genetic inhibition of any enzyme in this 
pathway leads to an increase in reactive oxygen species and a 
reduction in reduced glutathione. Moreover, knockdown of any 
component enzyme in this series of reactions also results in a pro- 
nounced suppression of PDAC growth in vitro and in vivo. 
Furthermore, we establish that the reprogramming of glutamine 
metabolism is mediated by oncogenic KRAS, the signature genetic 
alteration in PDAC, through the transcriptional upregulation and 
repression of key metabolic enzymes in this pathway. The essen- 
tiality of this pathway in PDAC and the fact that it is dispensable in 
normal cells may provide novel therapeutic approaches to treat 
these refractory tumours. 

The prognosis of patients with PDAC remains dismal. The disease is 
extremely aggressive and is profoundly resistant to all forms of therapy’. 
Thus, there is a strong impetus to identify new therapeutic targets for 
this cancer. In recent years, there has been renewed interest in under- 
standing the altered metabolism in cancer, and how such dependencies 
can be targeted for therapeutic gain. However, achieving a successful 
therapeutic index remains a major challenge to the development of 
effective cancer therapies that target metabolic pathways. 

Recent evidence demonstrates that some cancer cells use glutamine 
(Gln) to support anabolic processes that fuel proliferation’. However, 
the importance of Gln metabolism in pancreatic tumour maintenance 
is not known. Thus, we sought to explore the dependence of PDAC on 
Gln, and to examine the functional role of Gln in PDAC metabolism. 
As expected from our previous work’, glucose was required for PDAC 


growth. Additionally, PDAC cells were also profoundly sensitive to 
Gln deprivation, indicating that Gln is also critical for PDAC growth 
(Fig. la and Supplementary Fig. 1). 

Gln provides a carbon source to fuel the tricarboxylic acid (TCA) 
cycle and nitrogen for nucleotide, nonessential amino acid (NEAA) 
and hexosamine biosynthesis**®. To assess the role of Gln metabo- 
lism in PDAC growth, we first impaired glutaminase (GLS) activity 
using RNA interference (RNAi). Notably, GLS knockdown markedly 
reduced PDAC growth (Fig. 1b and Supplementary Fig. 2a, b). 
Consistent with this observation, glutamate (Glu) was able to support 
growth in Gln-free conditions (Supplementary Fig. 2c). 
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Figure 1 | PDAC use a non-canonical glutamine (Gln) metabolism 
pathway. a, PDAC proliferation requires both glucose and Gln. Cells were plated 
in complete media (10 mM glucose and 2 mM Gln) which was replaced the 
following day with glucose- or Gln-free medium supplemented with 10% dialysed 
fetal bovine serum. b, Relative clonogenic growth of 8988T cells expressing a 
control shRNA (shGFP) or two independent shRNAs to GLS1. c, Schematic 
overview of GLUD1- or transaminase-mediated Glu metabolism. d, Relative 
clonogenic growth of 8988T cells. x-KG (4mM), NEAA mixture (0.1 mM 
glycine, alanine, aspartate, asparagine, proline and serine) or the combination was 
added to media following Gln withdrawal. «-KG, u-ketoglutarate; Glc, glucose; 
Gln, glutamine; Glu, glutamate; NEAA, non-essential amino acid. Error bars 
represent s.d. of triplicate wells from a representative experiment. **P < 0.01. 
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Glu can be converted into «-ketoglutarate (a-KG) to replenish the 
TCA cycle through two mechanisms’; either by glutamate dehydro- 
genase (GLUD1) or transaminases (Fig. 1c). Indeed, many cancer cells 
rely on GLUD1-mediated Glu deamination to fuel the TCA cycle’, and 
a-KG has been shown to be an essential metabolite in Gln metabolism’. 
Surprisingly, dimethyl «-KG? did not restore growth upon Gln depri- 
vation (Fig. 1d), whereas the combination of 7-KG and an NEAA 
mixture (the output of transaminase-mediated Glu metabolism) mark- 
edly rescued proliferation in multiple PDAC lines (Fig. 1d and Sup- 
plementary Fig. 2d, e). Together, these data indicate that PDAC cells 
metabolize Gln in a manner that is different from canonical models'® 
and that transaminases may be critical for Gln metabolism in PDAC. 

To confirm the importance of transaminases in PDAC Gln metabo- 
lism, we treated PDAC cells with either aminooxyacetate (AOA), a pan- 
inhibitor of transaminases’, or epigallocatechin gallate (EGCG), an 
inhibitor of GLUD1”. Whereas EGCG had no effect on PDAC growth, 
AOA treatment robustly inhibited the growth of multiple PDAC cell 
lines (Supplementary Fig. 3). Consistent with these results, GLUD1 
knockdown also had no effect on PDAC growth (Fig. 2a). To identify 


the specific transaminase(s) involved in PDAC Gln metabolism, we 
inhibited a panel of Glu-dependent transaminases (aspartate, alanine 
and phosphoserine transaminase) individually using RNAi and exa- 
mined the effect on PDAC growth. Among this panel, knockdown of 
the aspartate transaminase GOT] significantly impaired PDAC growth 
in multiple PDAC cell lines and primary PDAC cells (Fig. 2a and 
Supplementary Fig. 4, 5). 

We next explored the direct effects of GOT1 on Gln metabolism by 
performing targeted metabolomic analysis in GOT1 knockdown PDAC 
cells using uniformly '*C-labelled Gln ({U-'*Cs]GIn) as a tracer*. 
GOTI catalyses the conversion of aspartate (Asp) and «-KG into oxa- 
loacetate (OAA) and Glu in the cytoplasm. Indeed, GOT1 knockdown 
led to increased Gln-derived Asp (and total Asp) and decreased OAA 
(Fig. 2b and Supplementary Fig. 6a). Interestingly, we also observed a 
significant decrease in the ratio of reduced-to-oxidized glutathione 
(GSH:GSSG; Fig. 2b and Supplementary Fig. 6b), suggesting that 
GOT] may play a role in the maintenance of cellular redox homeostasis. 
It should be noted that the changes in metabolite abundance described in 
this experiment are representative of the total cellular metabolite pool 
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Figure 2 | GOT1 is essential for redox balance and growth in PDAC. 

a, Relative clonogenic growth of 8988T cells expressing a control shRNA (shGFP) 
or two independent shRNAs targeting GLUD1, GOT1, GPT2 or PSAT1. Error 
bars represent s.d. of triplicate wells from a representative experiment. b, Relative 
metabolite abundance in 8988T cells grown in [U-°C.]GIn upon GOT1 
knockdown. Data are presented as the total metabolite pool (encompassing both 
metabolite derived from Gln and that not Gln-derived, left) and the 13C_labelled, 
Gln-derived metabolite pool (right). Error bars represent the s.d. of three 
independently prepared samples. c-f, Relative ROS levels in 8988T cells 

under conditions indicated as determined by DCFDA (2',7’- 
dichlorodihydrofluorescein diacetate) staining. DCFDA assay was performed 
24h after supplementing Gln-free media with either OAA (4mM) or dimethyl 
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malate (4 mM). Each bar represents the mean of three independent experiments 
with error bars representing the s.d. g, Schematic depiction of the cytoplasmic 
reactions that convert Asp into pyruvate. h, Relative metabolite abundance in 
8988T cells grown in [U-°Cs]GIn upon ME1 knockdown. i, NADP*/NADPH 
ratio in 8988T cells expressing a control shRNA (shGFP), or an shRNA to G6PD, 
IDH1, GOT] or ME1. Error bars represent s.d. of six replicate wells from a 
representative experiment. j, Flux of the Gln carbon skeleton into downstream 
metabolites as a function of time. The reads for uniformly '*C-labelled 
metabolites, presented in ion current, are plotted for cells expressing the shGFP 
control or shRNA to GOT1. Asp, aspartate; OAA, oxaloacetate; Glu, glutamate; 
GSH, reduced glutathione; GSSH, oxidized glutathione. Error bars represent the 
s.d. of three independently prepared samples. *P < 0.05; **P < 0.01. 
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due to technical limitations associated with organelle-specific metabolite 
isolation. Importantly however, the results we obtained are consistent 
with what one would expect if flux through GOT1 was impaired. 

In PDAC, we recently demonstrated that oncogenic KRAS enhances 
the flux of glycolytic intermediates specifically through the non- 
oxidative arm of the pentose phosphate pathway (PPP) to support 
DNA/RNA biosynthesis without affecting the NADPH-producing 
oxidative arm*. This decoupling of ribose biogenesis from NADPH 
production by KRAS suggests that PDAC cells may rely on an alterna- 
tive mechanism to maintain cellular redox balance. Indeed, inhibition 
of the oxidative PPP in PDAC by knockdown of the rate-limiting 
enzyme glucose-6-phosphate dehydrogenase (G6PD) had minimal 
effect on reactive oxygen species (ROS) levels and, consistent with this, 
glucose deprivation had only a modest impact on ROS (Supplementary 
Fig. 6c). Therefore, we speculated that the GOT1-mediated conversion 
of Gln-derived Asp into OAA functions in a pathway that is used to 
generate the NADPH which could be used to maintain redox balance. 
To test this hypothesis, we assessed ROS levels upon Gln deprivation in 
the absence or presence of OAA. Indeed, Gln deprivation induced ROS, 
and OAA could partially rescue the elevated ROS levels (Fig. 2c). GOT1 
knockdown also increased ROS levels, which again were significantly 
restored upon supplementation with OAA (Fig. 2d). 

Given that Gln-derived malate (and total malate) was significantly 
reduced upon GOT1 knockdown (Fig. 2b), we suspected that Gln- 
derived OAA is metabolized into malate, which is used by malic 
enzyme (ME1) to create NADPH, providing the reducing power to 
maintain reduced glutathione pools". Indeed, malate was able to par- 
tially rescue the oxidative stress imposed by Gln-deprivation (Fig. 2e) 
or GOT1 knockdown (Fig. 2f). These data are consistent with a model 
whereby Gln-derived Asp is converted by GOT1 into OAA, then 
converted into malate by malate dehydrogenase (MDH1) and subse- 
quently oxidized by ME] into pyruvate and reducing power in the 
form of NADPH (Fig. 2g). Consistent with this pathway, metabolomic 
analysis of [U-°C.]Gln tracing in ME] knockdown cells revealed a 
significant increase in Asp, malate and OAA and decreased GSH 
(Fig. 2h and Supplementary Fig. 6d). Furthermore, knockdown of 
GOT1 and MEI markedly increased the cellular NADP*/NADPH 
ratio (Fig. 2i), whereas inhibition of other cytosolic sources of 
NADPH (G6PD or isocitrate dehydrogenase, IDH1) had no effect on 
NADP*/NADPH ratios or ROS (Fig. 2i and Supplementary Fig. 6c). 
Together, the data suggest that PDAC use Gln through the pathway 
depicted in Fig. 2g to increase the NADPH/NADP* ratio for mainten- 
ance of redox homeostasis. Lastly, Gln tracing kinetic flux experiments 
in GOT1 knockdown cells clearly demonstrate decreased flux through 
this pathway (Fig. 2j and Supplementary Fig. 7). Interestingly, lactate in 
the '°C-tracing experiments was minimally labelled, indicating that the 
pyruvate produced by ME] is not used to make lactate by lactate dehy- 
drogenase and has a yet undetermined fate. 

The majority of Asp in PDAC cells (50-75%) is derived from Gln, as 
evidenced by '°C-labelling (Supplementary Fig. 6a, d and 8a). In prin- 
ciple, uniformly '*C-labelled Asp can be derived from Gln following 
either (1) the GLUD1-mediated conversion of Glu to «-KG (and its 
subsequent traversing through the TCA cycle) or (2) the mitochon- 
drial aspartate transaminase (GOT2)-mediated conversion of Glu and 
OAA to o-KG and Asp. Of these two enzymes, only GOT2 knockdown 
significantly affected PDAC growth (Fig. 2a and Supplementary Fig. 
8b). Consistent with this observation, GLUD1 knockdown did not 
affect Asp biosynthesis from Gln, whereas GOT2 knockdown resulted 
in a significant decrease in Gln-derived Asp in PDAC cells (Sup- 
plementary Fig. 8a, c). 

We next tested whether other components of this pathway are also 
necessary to support PDAC growth. Indeed, knockdown of either 
MDHI or MEI markedly inhibited clonogenic survival of PDAC cells 
(Fig. 3a, b and Supplementary Fig. 9) in a manner similar to GOT1 
knockdown. As a further test of enzyme knockdown specificity, we 
investigated the ability of central metabolites in this pathway to rescue 
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PDAC growth upon Gln deprivation. First, we confirmed that exogen- 
ously added metabolites permeated the cell and populated metabolite 
pools (Supplementary Fig. 10). Next, we demonstrated that the com- 
bination of GOT1 substrates, Asp and u-KG, could rescue cell growth in 
Gln-free conditions (Supplementary Fig. 11a). Additionally, OAA per- 
mitted PDAC growth under Gln-free conditions in multiple PDAC cell 
lines (Fig. 3c and Supplementary Fig. 11) as well as upon both GLS (Sup- 
plementary Fig. 12) and GOT1 knockdown (Fig. 3d and Supplementary 
Fig. 13a, b). Lastly, the addition of dimethyl-malate’* was able to parti- 
ally rescue PDAC cell growth upon Gln deprivation (Supplementary 
Fig. 13c) or GOT1 knockdown (Supplementary Fig. 13d). 

Next, to confirm that the OAA or malate rescue of PDAC growth 
upon Gln deprivation was through maintenance of redox homeostasis, 
we treated cells grown in Gln-free conditions with a cell-permeable GSH 
analogue. Remarkably, GSH dramatically rescued clonogenic growth 
following Gln-deprivation (Fig. 3e and Supplementary Fig. 11c) or 
GOT1 knockdown (Fig. 3d and Supplementary Fig. 13a, b). GSH was 
also able to rescue either MDH1 or ME1 knockdown (Supplementary 
Fig. 14a). Near-identical results were also seen with the antioxidant 
N-acetylcysteine (NAC) (Fig. 3e, f and Supplementary Fig. 14b, c). 
Together, these data support the idea that Gln is used by PDAC cells 
to maintain redox homeostasis, which is required to support cellular 
proliferation. 

As further confirmation of the importance of this pathway in 
PDAC, we suppressed GOT1, MDH1 and MEI expression using 
two lentiviral short hairpin RNAs (shRNAs) in PDAC cells and 
assessed their ability to grow as xenografts. Consistent with our in 
vitro results, both GLUD1 shRNAs had no effect on tumour growth 
(Supplementary Fig. 15). In contrast, GOT1, MDH1 and ME1 knock- 
down each robustly diminished tumour growth (Fig. 3g). These data 
provide further support for the critical role of this pathway in Gln 
metabolism and PDAC tumour growth. 
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Figure 3 | Metabolism of the Gln carbon skeleton through GOT1, MDH1 
and ME] supports PDAC growth by maintaining redox balance. 

a, b, Relative clonogenic growth of 8988T cells expressing a control shRNA 
(shGFP) or two independent shRNAs to MDH1 or ME1. ¢, e, Relative 
clonogenic growth of 8988T cells under conditions indicated. Cells were plated 
in complete culture media (10 mM glucose and 2 mM Gln), which was replaced 
the following day with Gln-free medium supplemented with OAA (4mM), 
GSH (4mM) or N-acetylcysteine (NAC) (4mM). d, f, Relative clonogenic 
growth of 8988T cells expressing a control shRNA (shGFP) or two independent 
shRNAs to GOT] with or without OAA (4mM), GSH (4mM) or NAC (4mM). 
Error bars represent the s.d. of triplicate wells from a representative experiment 
(a-f). g, Xenograft growth of 8988T cells expressing a control shRNA (shGFP), 
shRNAs to GOT1 (1 and 2), MDH1 (1 and 2) or MEI (1 and 2) in mice. Error 
bars represent s.e.m. (n = 10). *P < 0.05; **P<0.01. 
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In contrast to PDAC, this pathway seems to be dispensable in nor- 
mal cells. Indeed, treatment of non-transformed human pancreatic 
ductal cells (HPDEs) and human diploid fibroblasts (IMR90) with 
AOA had only modest effects on growth (Supplementary Fig. 16a). 
HPDE cells, unlike PDAC cells, were significantly sensitive to EGCG, 
suggesting a greater reliance on the activity of GLUD1 (Supplementary 
Fig. 16b). Consistent with these results, GOT1 knockdown did not 
impair the growth of HPDE and IMR90 (Supplementary Fig. 16c). 
We also obtained similar results in mouse ductal epithelial cells 
(mPDE) and mouse PDAC cell lines, with the mPDEs being highly 
insensitive to AOA and GOT1 knockdown and two independently 
derived mouse PDAC lines showing significant sensitivity to AOA 
and GOT1 knockdown (Supplementary Fig. 16d, e). Furthermore, 
using an inducible shGOT1 construct, we demonstrated that mouse 
PDAC rely on GOT] to sustain tumour growth in vivo (Supplementary 
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Figure 4 | Oncogenic KRAS mediates Gln reprogramming in PDAC. 

a, Expression of GLUD1 and GOT] was determined by quantitative RT-PCR in 
8988T cells expressing a control shRNA (shGFP) or two independent shRNAs 
targeting KRAS. Western blot confirmed knockdown of KRAS expression. Error 
bars represent the s.d. of three replicates. b, The effect of KRAS knockdown on 
GLUD1 or GOT] protein levels in Pancl cells expressing a doxycycline-inducible 
KRAS shRNA. ¢, Relative clonogenic growth of 8988T cells, expressing a control 
shRNA (shGFP) or shRNA to KRAS, following treatment with AOA or EGCG. 
Error bars represent the s.d. of triplicate wells from a representative experiment. 
d, Relative metabolite abundance in Miapaca2 cells grown in [U-C.]GIn 
following doxycycline-inducible KRAS knockdown. Data are presented as the 
total metabolite pool (encompassing both metabolite derived from Gln and that 
not Gln-derived, left) and the uniformly 13C labelled, Gln-derived metabolite 
pool (right). Error bars represent the s.d. of three independently prepared 
samples. e, Relative cell viability of 8988T cells treated with GLS inhibitors 968 
(active) (10 1M), 365 (inactive analogue) (50 uM) or BPTES (bis-2-(5 
phenylacetamido-1,2,4-thiadiazol-2-yl) ethyl sulfide) (100 nM) with increasing 
concentrations of H,O3. Error bars represent the s.d. of triplicate wells from a 
representative experiment. f, Model depicting the KRAS-regulated Gln metabolic 
reprogramming in PDAC used to maintain redox and support growth. AOA, 
aminooxyacetate; EGCG, epigallocatechin gallate. *P < 0.05; **P <0.01. 
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Fig. 17). Collectively, these data demonstrate that the GOT 1-mediated 
usage of the Gln carbon skeleton is a metabolic adaptation that PDAC, 
and not normal cells, have uniquely acquired to support growth. 

Our previous work demonstrated that anabolic glucose metabolism 
in PDAC is controlled by oncogenic KRAS, which leads to altered 
expression ofa number of rate-limiting metabolic enzymes*. To investi- 
gate the role of KRAS in the reprogramming of Gln metabolism, we 
assessed the expression of GOT1 and GLUD1 upon knockdown of 
KRAS in PDAC cells. Interestingly, KRAS knockdown resulted in a 
marked increase in GLUD1 and a decrease in GOT1 expression at 
the transcriptional level (Fig. 4a), as well as the protein level (Fig. 4b) 
in multiple PDAC lines (Supplementary Fig. 18a). Additionally, using 
five independent orthotopic tumours derived from our inducible KRAS 
PDAC model’, we show that expression of GOT1 increased and 
GLUDI1 decreased in an oncogenic KRAS-dependent manner in vivo 
(Supplementary Fig. 18b). These findings demonstrate that, in PDAC, 
oncogenic KRAS plays a critical role in coordinating the shift in Gln 
metabolism to maintain tumour growth and survival. 

We next assessed the sensitivity of PDAC cells to either AOA or 
EGCG upon KRAS knockdown using 8988T cells, a cell line that is not 
dependent on KRAS for survival'’. Consistent with our previous 
results, AOA significantly inhibited clonogenic growth, whereas 
EGCG had minimal effects. Interestingly, KRAS knockdown made 
the cells significantly more resistant to AOA and sensitive to EGCG 
(Fig. 4c and Supplementary Fig. 18c). To confirm the role of oncogenic 
KRAS in the reprogramming of Gln metabolism, a targeted metabo- 
lomic analysis using [U-°Cs]Gln was performed upon KRAS knock- 
down. Indeed, the changes observed were consistent with KRAS 
supporting the anabolic metabolism of Gln, where multiple metabo- 
lites in the GOT 1-dependent pathway were significantly deregulated 
(Fig. 4d and Supplementary Fig. 19). 

Given the importance of Gln metabolism in maintaining the redox 
state of PDAC, we speculated that inhibition of anabolic Gln metabo- 
lism may sensitize PDAC to oxidative stress. To test this concept, we 
inhibited Gln metabolism in PDAC cells using a GLS inhibitor and 
examined whether this would synergize with hydrogen peroxide 
(H,O2) treatment. First, we found that two chemically distinct GLS 
inhibitors*’” had a growth-suppressive effect on both human and 
mouse PDAC cells (Supplementary Fig. 20a, b), consistent with the 
GLS knockdown data (Fig. 1b). In addition, when combined with 
H,03,, this effect was notably augmented, indicating that PDAC cells 
are markedly more sensitive to ROS when Gln metabolism is impaired 
(Fig. 4e and Supplementary Fig. 20c). This finding may have signifi- 
cant therapeutic implications, given that clinical grade GLS inhibitors 
are being developed* and that standard PDAC therapies (such as 
radiation) lead to the generation of ROS. Moreover, because this aspect 
of Gln metabolism does not seem as critical in normal cells, these data 
indicate an accessible therapeutic window. 

Collectively, our data reveal a novel dependence on the transami- 
nases GOT2 and GOT! for metabolism of the glutamine carbon skel- 
eton in PDAC. These reactions lead to the cytosolic conversion of 
aspartate into OAA, malate and then pyruvate and are required to 
sustain PDAC growth, probably through maintaining redox balance. 
Importantly, our work also demonstrates that oncogenic KRAS med- 
iates this reprogramming of glutamine metabolism (Fig. 4f). Although 
this pathway is critical for redox balance and cell growth in PDAC, it 
does not preclude the involvement of other pathways that may con- 
tribute to redox balance such as glutathione synthesis”’?”° or an NRF2- 
dependent mechanism”’. Furthermore, there may be cell-type-specific 
differences, as KRAS-transformed fibroblasts require both GLUD1 
and transaminases for cell growth’. Lastly, these findings may have 
implications for future therapeutic approaches as inhibition of gluta- 
mine metabolism in PDAC can potentially synergize with therapies 
that increase intracellular reactive oxygen species, such as chemother- 
apy and radiation. 
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METHODS SUMMARY 


Proliferation and clonogenic assays were performed as previously described”’. 
To characterize Gln metabolism, targeted liquid chromatography-tandem mass 
spectrometry was performed’. Briefly, cells were grown in complete media and 
transferred into Gln-free media supplemented with [U-'°C.]Gln overnight 
(steady state) or for the indicated time points (flux analyses). For subcutaneous 
xenografts, PDAC cells infected with lentiviral shRNAs to suppress target gene 
expression were suspended in 100 pl HBSS and injected subcutaneously into the 
lower flank of NCr nude mice. For mouse xenografts, murine PDAC cells stably 
infected with a doxycycline-inducible GOT1 shRNA construct were injected. 
Animals were fed with doxycycline water starting on the day of injection or when 
tumour volume reached ~50mm*. Complete images of western blots are pre- 
sented in Supplementary Fig. 21. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cell culture. Cell lines were obtained from the American Type Culture Collection 
or the German Collection of Microorganisms and Cell Cultures. All cell lines were 
tested routinely, and before all metabolomic analyses, for mycoplasma contam- 
ination. RPMI-1640, fetal bovine serum and dialysed fetal bovine serum (dFBS) 
were purchased from Invitrogen. Glucose-free DMEM (containing 2mM Gln), 
dimethyl «-KG, Asp, GSH reduced ethyl ester, OAA, dimethyl malate and DMEM 
powder (without glucose and Gln) were obtained from Sigma, and Gln-free RPMI 
1640 was purchased from Cellgro. Cosmic calf serum (CCS) was obtained from 
Thermo Scientific. Cells were cultured in the following media: 8988T, Pancl, 
MPanc96, Miapaca2 and PL45 in DMEM supplemented with 10mM glucose 
and 10% CCS; 8902 in RPMI with 10% CCS; IMR90 in MEM with 10% FBS; 
HPDE cells were cultured as described previously™*. Primary human PDAC lines 
were generated from ascites fluid under IRB approved protocols 02-240 and 
2007P001918. Lines were sequenced and confirmed to have KRAS mutations. 
Cell proliferation assay”*. Cells were plated in 24-well plates at 2,000 cells per well 
in 0.5 ml of media. To deprive Gln, cells were plated in complete culture media 
(10mM glucose and 2mM Gln), which was exchanged with Gln-free medium 
supplemented with 10% dFBS the following day. Media was not changed through- 
out the course of the experiment. At the indicated time points, cells were fixed in 
10% formalin and stained with 0.1% crystal violet. Dye was extracted with 10% 
acetic acid and the relative proliferation was determined by attenuance (D) at 
595 nm. 

Clonogenic assay”. Cells were plated in 6-well plates at 300 cells per well in 2 ml of 
media. Media was not changed throughout the course of the experiment. After 
7-10 days, colonies were fixed in 80% methanol and stained with 0.2% crystal 
violet. 

Quantitative RT-PCR. Total RNA was extracted using TRIzol (Invitrogen) and 
reverse transcription was performed from 2 1g of total RNA using oligo-dT and 
MMLV HP reverse transcriptase (Epicentre), according to the manufacturer’s 
instructions. Quantitative RT-PCR was performed with SYBR Green dye using 
an Mx3000PTM instrument (Stratagene). PCR reactions were performed in trip- 
licate and the relative amount of cDNA was calculated by the comparative Cr 
method using the 18S ribosomal RNA sequences as a control. See Supplementary 
Information for primer sequences. 

Xenograft studies. For subcutaneous xenografts, 8988T cells were infected with 
lentiviral shRNAs targeting GLUD1 (n = 2), GOT1 (n = 2), MDHI1 (n = 2), ME1 
(n = 2) and GFP (control hairpin, n = 1) and subjected to a short puromycin 
selection (2 4g ml‘); shRNA sequences are presented below. 1.5 X 10° cells, sus- 
pended in 100 pl Hanks Buffered Saline Solution (HBSS), were injected subcuta- 
neously into the lower flank of NCr nude mice (Taconic). Tumour length and 
width were measured twice weekly and the volume was calculated according to the 
formula (length  width”)/2. All xenograft experiments with human PDAC lines 
were approved by the HMS Institutional Animal Care and Use Committee 
(IACUC) under protocol number 04-605. For mouse xenografts, a doxycycline- 
inducible GOT1 shRNA construct was generated by annealing oligonucleotides 
that form an shRNA sequence to mouse GOT1, shRNA (forward: CCGGCCA 
CATGAGAAGACGTTTCTTCTCGAGAAGAAACGTCTTCTCATGTGGTTT- 
TTG; reverse: AATTCAAAAACCACATGAGAAGACGTTTCTTCTCGAGA 
AGAAACGTCTTCTCATGTGG). The annealed product was ligated into the 
Agel-EcoRI sites of the pLKO-Tet-on vector. For subcutaneous xenograft, 10° 
stably infected murine PDAC cells were suspended in 100 ul Hanks Buffered 
Saline Solution and injected subcutaneously into the lower flank of NCr nude mice 
(Taconic). Animals were fed with doxycycline water (doxycycline 2 g1~', sucrose 
201‘) starting on the day of injection or when tumour diameter reached 50 mm. 
Tumour volumes were measured every third day starting from day 4 post-injection 
and calculated as above. These xenograft experiments were approved under 
MDACC IACUC protocol 111113931. 

Western blot analysis. After SDS-PAGE, proteins were transferred to Hybond-N 
Nitrocellulose (Amersham Biosciences). Membranes were blocked in Tris-buffered 
saline (TBS) containing 5% non-fat dry milk and 0.1% Tween 20 (TBS-T), before 
incubation with the primary antibody overnight at 4°C. The membranes were 
then washed with TBS-T followed by exposure to the appropriate horseradish 
peroxidase-conjugated secondary antibody for 1h and visualized on Kodak X-ray 
film using the enhanced chemiluminescence (ECL) detection system (Thermo 
Scientific). The following antibodies were used: KRAS (F234, Santa Cruz), GOT1 
(NBP1-54778, Novus), GLUD1 (ab55061, Abcam) and B-actin (A2066, Sigma). 


ROS quantification. DCFDA assay was performed 24 h after supplementing Gln- 
free media with either OAA (4mM) or dimethyl malate (4mM). Cells were 
incubated with 5uM_ 2',7’-dichlorodihydrofluorescein diacetate (DCFDA, 
Invitrogen) for 30 min. Excess DCFDA was removed by washing the cells twice 
with PBS, and labelled cells were then trypsinized, rinsed, and resuspended in PBS. 
Oxidation of DCFDA to the highly fluorescent 2’,7’-dichloro-fluorescein (DCF) is 
proportionate to ROS generation and was analysed by flow cytometry. 
Metabolomics. For steady state metabolomic analysis, PDAC cell lines were 
grown to ~50% confluence in growth media (DMEM, 2 mM Gln, 10 mM glucose, 
10% CCS) on 10-cm dishes in biological quadruplicate. A complete media change 
was performed 2 h before metabolite collection. To trace Gln metabolism, PDAC 
cell lines were grown as above and then transferred into Gln-free DMEM (with 
10 mM glucose) containing 10% dialysed FBS and 2 mM [U-'°Cs]Gln (Cambridge 
Isotope Labs) overnight (for steady state labelling) or for the indicated time points 
in the flux analyses. Additionally, fresh media containing [U-'?C;]Gln was 
exchanged 2 h before metabolite extraction for steady state analyses. The quantity 
of the metabolite fraction analysed was adjusted to the corresponding protein 
concentration calculated upon processing a parallel 10-cm dish. Metabolite frac- 
tions were collected and analysed by targeted LC-MS/MS via selected reaction 
monitoring (SRM), as described". Processed data was analysed in Cluster 3.0 and 
TreeViewer. 

Measurement of sensitization of PDAC cells to ROS. PDAC cell lines were 
plated into 96-well plates at 10° cells per well in 200 ul of growth media. The 
following day, growth media was replaced with that containing GLS inhibitors 
and/or H,O,. Parallel plates were analysed at 3, 6 and 9 days by Cell Titer Glo 
analysis (Promega), per the manufacturer’s instruction. The GLS inhibitors 968 
(active) and 365 (structurally similar, inactive) were provided as a gift from the 
Cerione laboratory’. BPTES was a gift from Jaime Escobedo (Forma Therapeutics). 
Lentiviral-mediated shRNA targets. All shRNA vectors were obtained from the 
RNA Interference Screening Facility of Dana Farber Cancer Institute. The 
sequences and RNAi Consortium clone IDs for each shRNA are as follows. 
shGLS-1, GCACAGACATGGTTGGTATAT (TRCN0000051135); shGLS-2, 
GCCCTGAAGCAGTTCGAAATA (TRCN0000051136); shMDH1-1, CCCTGT 
TGTAATCAAGAATAA (TRCN0000221892); shMDH1-2, GCAACAGATAA 
AGAAGACGTT (TRCN0000221893); shME1-1, GCCTTCAATGAACGGCC 
TATT (TRCN0000064728); shME1-2, CCAACAATATAGTTTGGTGTT 
(TRCN0000064729); shGLUD1-1, CCCAAGAACTATACTGATAAT (TRCN 
0000220878); shGLUD1-2, GCAGAGTTCCAAGACAGGATA (TRCN000022 
0880); shGOT1-1, GCGTTGGTACAATGGAACAAA (TRCN0000034784); 
shGOT1-2, GCTAATGACAATAGCCTAAAT (TRCN0000034785); shGPT2-1, 
CGGCATTTCTACGATCCTGAA (TRCN0000035024); shGPT2-2, CCATCAA 
ATGGCTCCAGACAT (TRCN0000035025); shPSAT1-1, GCCAAGAAGTTT 
GGGACTATA (TRCN0000035264); shPSAT1-2, CCAGACAACTATAAGGT 
GATT (TRCNO0000035265); shKRAS-1, CCTCGTTTCTACACAGAGAAA 
(TRCN0000040148); shKRAS-2, GAGGGCTTTCTTTGTGTATTT (TRCN0000 
033260). 

Reagents. NADP*/NADPH ratios were determined using the NADP/NADPH 
assay kit (Abcam; ab65349) according to the manufacturer’s instructions. Briefly, 
10° cells (n = 6 wells of a 6-well dish) were collected on ice in extraction buffer and 
subject to two rounds of freeze-thaw at — 80 °C. NADP* and NADPH values were 
determined in biological sextuplet and concentration was obtained by comparison 
to standard curves. OAA was not analysed by targeted LC/MS-MS due to its 
limited stability in aqueous solvents at room temperature”. As such, the abund- 
ance of this metabolite was determined using a quantification kit (Biovision), 
according the manufacturer’s instruction. Briefly, 2 x 10° cells (n= 4, 10-cm 
dishes) were collected during log-phase growth by trypsinization, resuspended 
immediately in the buffers provided (on ice), analysed and compared to standard 
curves. The signals obtained were normalized to the protein concentration calcu- 
lated upon processing a parallel 10-cm dish. NEAA mixture consisted of 0.1 mM 
glycine, alanine, aspartate, asparagine, proline and serine. 


24. Ouyang, H. et al. Immortal human pancreatic duct epithelial cell lines 
with near normal genotype and phenotype. Am. J. Pathol. 157, 1623-1631 
(2000). 

25. Bajad, S. U. et al. Separation and quantitation of water soluble cellular 
metabolites by hydrophilic interaction chromatography-tandem mass 
spectrometry. J. Chromatogr. A 1125, 76-88 (2006). 
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Protein N-myristoylation is a 14-carbon fatty-acid modification 
that is conserved across eukaryotic species and occurs on nearly 
1% of the cellular proteome’. The ability of the myristoyl group to 
facilitate dynamic protein-protein and protein-membrane inter- 
actions (known as the myristoyl switch) makes it an essential 
feature of many signal transduction systems*. Thus pathogenic 
strategies that facilitate protein demyristoylation would markedly 
alter the signalling landscape of infected host cells. Here we describe 
an irreversible mechanism of protein demyristoylation catalysed 
by invasion plasmid antigen J (IpaJ), a previously uncharacterized 
Shigella flexneri type III effector protein with cysteine protease 
activity. A yeast genetic screen for IpaJ substrates identified ADP- 
ribosylation factor (ARF)1p and ARF2p, small molecular mass 
GTPases that regulate cargo transport through the Golgi apparatus’. 
Mass spectrometry showed that IpaJ cleaved the peptide bond 
between N-myristoylated glycine-2 and asparagine-3 of human 
ARF1, thereby providing a new mechanism for host secretory 
inhibition by a bacterial pathogen**. We further demonstrate that 
IpaJ cleaves an array of N-myristoylated proteins involved in cel- 
lular growth, signal transduction, autophagasome maturation and 
organelle function. Taken together, these findings show a previ- 
ously unrecognized pathogenic mechanism for the site-specific 
elimination of N-myristoyl protein modification. 

While seeking to understand how bacterial pathogens regulate the 
host secretory pathway’, we found that S. flexneri potently inhibited 
cargo transport through the Golgi apparatus (Supplementary Fig. 1). 
Further investigation showed that the Golgi was severely fragmented 
after Shigella infection (Fig. 1a). By contrast, neither Listeria mono- 
cytogenes nor Salmonella typhimurium disrupted the Golgi, suggesting 
that Shigella has evolved a specific, yet poorly understood, mechanism 
to regulate host organelle function. 

Shigella infection requires translocation of more than 20 bacterial 
‘effector’ proteins into host cells through the Mxi-SPA Type III Sec- 
retion System (T3SS)*. Deletion of mxiD, a component of the needle 
complex required for T3SS, eliminated Golgi destruction by Shigella 
infection (Fig. 1d). To identify the specific effector protein required for 
this activity, Golgi morphology was assessed after transient transfec- 
tion of 20 effector genes (Supplementary Fig. 2a, b). Both IpaJ and 
VirA induced profound Golgi fragmentation (Fig. 1b) and inhibited 
hormone trafficking through the secretory pathway (Supplementary 
Fig. 2c). However, the other T3SS effector proteins had no discernible 
effect on Golgi structure or function (Supplementary Fig. 2a). 

Comparison of cells infected with Shigella AipaJ and AvirA gene 
deletion strains showed that these mutants induced abnormal Golgi 
morphologies (Fig. 1c) and variable degrees of Golgi disruption (Fig. 1d), 
yet neither mutant alone fully attenuated Golgi destruction. By con- 
trast, the Golgi remained intact and functional in cells infected with 
AipaJ/virA double mutant strain (Fig. 1c, d). The Shigella AipaJ/virA 
strain showed normal host cell invasion, intracellular replication and 


actin-based motility, suggesting Golgi disruption was specifically 
caused by these effector genes (Supplementary Fig. 3). Next, in vivo 
bacterial replication was evaluated using an established mouse model 
of mucosal infection’. The number of recoverable bacteria was sharply 
reduced (by 100-fold, P< 0.001) by deleting either ipaj or virA genes 
compared with wild-type Shigella (Fig. le). As expected, bacterial rep- 
lication was further attenuated in the Shigella AmixD mutant lacking 
all T3SS function (Fig. le). We also found reductions in host inflam- 
matory cytokines in mice infected with each Shigella mutant strain 
(data not shown). Taken together, these data demonstrate that two 
Shigella effector proteins, IpaJ and VirA, each harbour Golgi inhibitory 
activity, and are essential and play non-redundant roles for optimal 
in vivo virulence. 

Although VirA was recently shown to inactivate Rabl1 GTPase sig- 
nalling pathways’, extra mechanisms are necessary for Golgi destruc- 
tion during Shigella infection®. We therefore focused on IpaJ because 
its molecular mechanism of action is poorly understood'®"’. BLAST 
database searches identified IpaJ homologues in numerous bacterial 
species, but this sequence-based alignment did not offer any clues to its 
function (Supplementary Fig. 4). In contrast, structural-based bioin- 
formatics showed that IpaJ is distantly related to the cysteine peptidase 
C39-like family of domains of unknown function (annotated DUF3335) 
(Fig. 2a). C39-family members cleave leader peptides from bacteriocins 
with anti-microbial and quorum-sensing activities'’*. Although IpaJ is 
unlikely to function in these capacities, it harbours the catalytic cysteine 
(C), histidine (H) and aspartate (D) residues required for peptide bond 
hydrolysis (Fig. 2a). As predicted by this alignment, alanine substi- 
tutions at C64, H206 or D218 abolished the ability of IpaJ to disrupt 
host Golgi morphology in transfection studies (Fig. 2b). In addition, 
complementation of Shigella AipaJ/virA deletion strain with plasmid 
expression of IpaJ induced Golgi disruption phenotype whereas IpaJ 
C64A, H206A and D218A had no effect (Supplementary Fig. 5). 

Previous studies have shown that IpaJ induces growth arrest pheno- 
type in yeast’’. A similar growth inhibitory activity was found with IpaJ, 
whereas yeast grew normally in the presence of IpaJ C64A, H206A 
or D218A catalytic mutants (Fig. 2c). Using a multicopy suppressor 
screen, we identified three distinct genomic loci that, when intro- 
duced on high-copy plasmids, suppressed IpaJ activity in yeast. Two 
loci encoded ARF1p and ARF2p GTPases, and the third loci encoded 
VPS15p, a phosphatidylinositol kinase required for yeast vacuole 
fusion” (Fig. 2d). Although these data suggest that IpaJ may regulate 
several host signalling pathways, we focused our initial efforts on ARF 
GTPases because these enzymes are master regulators of cargo traf- 
ficking through the Golgi apparatus*. Overexpression of either ARF1p 
or ARF2p rescued the yeast growth arrest phenotype, thereby defining 
ARF GTPases as potential substrates of IpaJ (Fig. 2e). 

ARF1 functionally couples guanine-nucleotide exchange (GDP for 
GTP) and membrane binding (through N-myristoylation) to the 
recruitment of COPI coat-complex to Golgi membranes as a key step 
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Figure 1 | ShigellaIpaJ and VirA disrupt Golgi morphology. a, Fluorescence 
microscopy HeLa cells infected with Shigella and Listeria visualized with 
4',6-diamidino-2-phenylindole (DAPI) (pseudo-coloured red), and Salmonella 
carrying mCherry-expressing vector (red). The cis-Golgi (green) was detected 
by «%-GM130 antibodies. Scale bar, 10 jim. b, Fluorescence microscopy of 
HeLa cells transfected with either IpaJ or VirA. The cis-Golgi (GM130, green) 
and F-actin (red) are shown. Scale bar, 10 um. c, Fluorescence microscopy 

of HeLa cells infected with indicated Shigella mutant strains. Scale bar, 

10 jum. d, Percentage of HeLa cells with disrupted Golgi morphology in 100 
cells infected with either wild-type Shigella (M90T) or the indicated mutants. 
Error bars, means + s.e.m. calculated from three independent experiments. 

e, Number of recoverable c.f-u. per mouse lung 24 h after intranasal inoculation 
with 1 X 10° of wild-type M90T or each mutant Shigella strain as indicated. 
Limit of detection (LOD) for this assay is indicated by the dotted line. Error 
bars, geometrical means + s.e.m. **P < 0.001. 


in vesicle fission*. Surprisingly, IpaJ had no effect on the guanine- 
nucleotide cycle or GTPase-dependent interactions of ARF1 in vitro 
(Supplementary Fig. 6). We therefore searched for IpaJ-induced 
post-translational modifications on ARF1 by ‘top-down’ mass spec- 
trometry'*’®. ARF1 bearing a carboxy (C)-terminal Strep affinity tag 
(ARF 1-strep) was expressed in HEK293T cells and purified by Strep- 
Tactin chromatography. We observed an intact monoisotopic molecu- 
lar mass of 26,895.95 Da and subsequent tandem mass spectrometry 
(MS/MS) resulted in 30 fragment ions that, together, provided an un- 
ambiguous assignment of N-myristoylated ARF1-strep protein (Fig. 3a 
and Supplementary Fig. 7a). Cellular expression of IpaJ resulted in a 
266.35 Da decrease in molecular mass of ARF1-strep, which corre- 
sponds closely to the mass of the N-myristoyl group and the amino 
(N)-terminal glycine (267.22 Da) (Fig. 3b). Indeed, the MS/MS data 
support the idea that IpaJ cleaved the amide bond between glycine-2 
and asparagine-3, liberating the N-myristoylated glycine from the ARF1 
GTPase domain (Fig. 3d and Supplementary Fig. 7b). ARF1 cleavage 
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Figure 2 | IpaJ belongs to the C39-like family of cysteine proteases and 
targets ARF-family GTPases. a, Sequence-structure alignment of S. flexneri 
IpaJ with members of C39-peptidase-like family identified by HHPred. IpaJ 
possesses invariant catalytic triad residues Cys 64, His 206 and Asp 218 (red). 
The conserved Gln 58, which helps form the oxyanion hole found in many 
proteases, is shown in green. Invariant residues and core hydrophobic residues 
found in the C39-family are shown in blue and magenta, respectively. 
Characterized C39-family member (GCN5-acyetyl transferase) and a 
C1-family member (Papain) are shown. b, Fluorescence microscopy of 

HeLa cells transfected with the indicated IpaJ mutants. The cis-Golgi (GM130, 
green) and F-actin (red) are shown. Scale bar, 10 um. ¢, IpaJ or its catalytic 
mutants were expressed from a galactose-inducible promoter (pGal413 vector) 
and assayed for a growth arrest phenotype on galactose or glucose (control) 
carbon source. d, Illustration of the yeast genomic clones isolated from the IpaJ 
suppressor screen. ARFlp, ARF2p and VPS15p are highlighted. e, Yeast strain 
harbouring a galactose-inducible IpaJ gene were transformed with a multi-copy 
vector containing the indicated genes. Yeast were assayed for a growth arrest 
phenotype on galactose or glucose (control) carbon source. 


was not observed in IpaJ C64A-treated samples, consistent with the 
function of IpaJ as cysteine protease (Fig. 3c and Supplementary Fig. 7c). 
Next, we devised a protocol to reconstitute the ARF1 cleavage reac- 
tion in vitro (Fig. 3e). Because Escherichia coli does not express a 
system for N-myristoylation, ARF1-His was co-expressed with yeast 
N-myristoyl transferase (NMTp) in the presence of exogenous azide- 
conjugated myristic acid. The lipid state of purified ARF1-His was 
visualized by fluorescently labelling the azido-myristoyl group by 
click chemistry’” (Fig. 3f, lanes 1 and 2). Importantly, IpaJ cleaved 
N-myristoylated ARF1 in vitro whereas this reaction was less efficient 
with IpaJ C64A (Fig. 3f, lane 3 and 4). Similar results were also 
observed when IpaJ was incubated with ARF1-strep isolated from 
mammalian cellular lysates following a similar procedure (Fig. 3g). 
Having established that IpaJ cleaves ARF1, we revisited our studies 
in yeast showing that IpaJ may target several host substrates including 
VPS15p, a lipidated kinase’. Although ARF1 and VPS15p are both 
N-myristoylated on their glycine-2 residue, they do not share any 
sequence or functional similarity. These observations suggested that 
IpaJ might cleave a broader range of substrates than initially suspected. 
To test this hypothesis, we labelled numerous of N-myristoylated pro- 
teins found in HEK293T cells and found that a large proportion of 
these proteins were demyristoylated in IpaJ-treated samples (Fig. 3h, 
arrows). The spectrum of N-myristoylated proteins was unaltered 
by IpaJ C64A mutant, thereby verifying the proteolytic-basis of this 
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Figure 3 | IpaJ cleaves the N-myristoylated glycine of lipidated substrates. 
a-c, Mass spectra of purified ARF1-strep in untreated (a), IpaJ-treated (b) or 
IpaJ C64A-treated (c) cells, with the observed monoisotopic mass reported in 
daltons. *Observed 1 Da shift in molecular mass is accounted for in MS/MS 
data (Supplementary Fig. 7). d, Structure of N-myristoylated ARF1 N terminus 
in untreated (left) and IpaJ-treated (right) samples. R group following Asn 3 
denotes ARF1-strep protein residues 4-240. e, Reconstitution of ARF1-His 
N-myristoylation (N-myr) by NMTp in bacterial cells and IpaJ cleavage 
reaction in vitro (see Supplementary Information and Methods). f, In-gel 
fluorescence assay (top panel) visualizing Alexa Fluor 647-labelled 
myristoylated ARF1-His isolated from bacteria either not expressing (lane 1) or 
expressing (lane 2) NMTp. Bacterial cell lysates treated with recombinant IpaJ 
or IpaJ C64A as indicated. The expression levels of ARF1-His were determined 
by Coomassie blue stain. g, In-gel fluorescence assay (top panel) visualizing 
Alexa Fluor 647-labelled myristoylated ARF1-strep purified from HeLa cell 
lysates left untreated or incubated with IpaJ or its catalytic mutant as indicated. 
The expressed amounts of ARF1-strep are shown (bottom panel). h, In-gel 
fluorescence assay visualizing protein extracts isolated from HeLa cells 
incubated with azide myristic acid, azide palmitic acid or geranylgeranyl alcohol 
azide and subsequently labelled with Alexa Fluor 647 alkyne by click chemistry. 
Arrows indicate proteins that are proteolytically demyristoylated by Ipaj. 
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reaction. IpaJ had no effect on either palmitoylation or geranylgera- 
nylation, indicating that the protease specifically cleaves proteins 
modified by the myristoyl group (Fig. 3h). 

N-myristoylated proteins share a conserved glycine but their 
sequences are otherwise highly variable (Fig. 4a). To determine how 
IpaJ recognizes diverse substrates, we first generated chimaeric enhanced 
green fluorescent protein (EGFP) fusion proteins that expressed 10 
amino-acid leader peptides derived from c-Src, MARCKs, hVPS15 
and GRASP65, N-myristoylated proteins involved in cellular growth, 
signal transduction, autophagasome maturation and organelle func- 
tion, respectively (Fig. 4a)'**. IpaJ cleaved each of these lipid-modified 
peptides in a reaction requiring its cysteine protease activity (Fig. 4b). 
The ability of IpaJ to cleave unrelated peptides suggested that the 
protease might recognize the lipidated glycine-2 common to all of its 
substrates. We found that soluble myristic acid was unable to inhibit 
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ARF! proteolysis, suggesting that IpaJ does not simply recognize long 
hydrocarbon chains (Supplementary Fig. 8). However, protein lipida- 
tion was essential for proteolysis because IpaJ was unable to cleave 
non-myristoylated ARF1 in vitro (Supplementary Fig. 9). When taken 
together, these data indicate that IpaJ specifically recognizes the 
conserved glycine-2 of substrates in context of its amide linkage to 
N-myristic acid. 

N-myristoylation is unique among lipid modifications in that it 
undergoes a dynamic conformational switch in response to upstream 
signalling events (termed the ‘myristoyl switch’)’. For example, the 
myristoyl group is sequestered in GDP-inactive ARF1 and liberated 
for membrane binding and signalling by GIP exchange (Supplemen- 
tary Fig. 10)**?**. We found that IpaJ was unable to cleave ARF1 T31N 
(a GDP-bound mutant form), but efficiently cleaved ARF1 Q71L (a 
mutant locked it the GTP-bound active state) in cells (Fig. 4c). These 
findings suggest IpaJ recognizes substrates as they cycle between acti- 
vity-dependent conformational states. Consistent with this interpreta- 
tion, microinjection of purified IpaJ induced the release of activated 
ARF1 from Golgi membranes just before organelle fragmentation 
(Fig. 4d). ARF1 was also displaced from Golgi membranes by wild- 
type Shigella infection but not by AipaJ single mutant (note that the 


a b sohves15-GFP c-Src-GFP 
IpaJ cleavage site ‘a roa 
S$ $ S) 
& PP & 2 @ 
Hs ARF1  2GNIFANLFK-10 “wee VS 
Sc ARFlp 2-GLFASKLFS-10 hVps15 c-Src 
Hs hVPS15 2-(GNQLAGIAP-10 eR N-my) Ph -myn) 
Sc VPS15p 2-GAQLSLVVQ-10 = 22] "Vests c-Sre 
Hs c-Src 2-GSNKSKPKD-10 bested (input)? FRY input) 
Hs MARCKS 2-GAQFSKTAA-10 pad Ipad 
Hs GRASP65 2-GLGVSAEQP-1@ — = | input) SF input) 
© arriT3iN ARF1 Q71L MARCKS-GFP GRASP65-GFP 
‘a no ne 2 ye 
re SF ‘ & oy S& wo of 
oe 
WP Owe FP LPL MSs 
ARF1 ARF1 -| MARCKS GRASP65 
(N-myr) (N-myr) = =| (v-myr) Ce N-myr) 
ARF1 ARF1 MARCKS _— GRASP65 
(input) (input) ae | input) —_ FF (input) 
Ipad IpaJ Ipad ae | PAY 
(input) (input) (input) (input) 


e ARF1-GFP GM130 


Shigella 
Figure 4 | Conformational-dependent cleavage of lipidated substrates by 
IpaJ. a, Multiple-sequence alignment of N-myristoylated glycine and 
downstream sequences from the indicated proteins (Hs, Homo sapiens; Sc, 
S. cerevisiae). b, In-gel fluorescence assay showing myristoylated peptides of the 
indicated proteins in cells expressing IpaJ or catalytic mutant. The peptide- 
eGFP and IpaJ inputs are indicated. *Demyristoylation of GRASP65 and 
hVPS15 resulted in slower mobility of the resulting peptide, potentially owing 
to reduced mobility in SDS—polyacrylamide gel electrophoresis caused by 
proteolytic reaction. c, In-gel fluorescence assay showing myristoylated ARF1 
T31N mutant (a GDP-locked mutant) or ARF1 Q71L (a GTP-locked mutant) 
after in cells co-expressing IpaJ or catalytic mutants as indicated. Equal loading 
of ARF1 (middle panel) and IpaJ (lower panel) was determined by western blot 
analysis. d, Fluorescence microscopy showing ARFl-mCherry and the Golgi 
apparatus (GM130) after cellular microinjection of recombinant IpaJ or IpaJ 
C64A, over the indicated period. Scale bar, 10 um. e, Fluorescence microscopy 
showing ARF1-eGFP (green) and Golgi membranes (blue, GM130) after 


infection with Shigella M90T and the indicated mutants (red; mCherry- 
expressing bacteria). Scale bar, 101m. 
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Golgi is fragmented by VirA in this infection) or AipaJ/AvirA double 
mutant strains (Fig. 4e). Extra cell-based experiments support the 
notion that GTP-active ARF1 is a physiological target of IpaJ during 
Shigella infection (Supplementary Figs 11-13), thereby defining a new 
mechanism for inhibition of cytokine secretion and immune detection 
by a bacterial pathogen. 

Here we have determined that IpaJ is a cysteine protease that cleaves 
the N-myristoylated glycine from ARF1 and potentially numerous 
host cellular substrates. This discovery adds significantly to the small 
but increasing number of bacterial proteases that cleave lipidated pro- 
teins as a mechanism of microbial pathogenesis**”°. Importantly, the 
ability of IpaJ to eliminate the N-myristoyl modification of unrelated 
proteins may allow Shigella to exploit signalling pathways in a variety 
of host cellular contexts’’. Although extra studies will be needed to 
unravel the hierarchy of these events, our discoveries provide a unique 
framework from which to explore the functional consequences of 
protein N-myristoylation in human health and disease. 


METHODS SUMMARY 


All bacterial effector genes were amplified by PCR from S. flexneri M90T genomic 
DNA and cloned into the relevant expression vectors. S. flexneri M90T knockout 
strains were generated using a A red recombinase-mediated recombination sys- 
tem**. For the mucosal model of infection, S. flexneri strains were administered 
dropwise (10° colony-forming units (c.f.u.)) into the external nares of mice, and 
bacterial c.fu. were enumerated by plating serial dilutions of lung homogenates as 
described previously’. For structural bioinformatics, HHpred was used to detect 
known Pfam domains with distant structural homology relationships, and 
PROMALS3D” was used to produce a multiple-sequence alignment to C39-like 
family members. We performed the yeast suppressor screen by integrating a 
single copy of the ipaJ under the Gall promoter into Y7092 yeast strain genome. 
A Saccharomyces cerevisiae genomic library (Yep13 vector, American Type 
Culture Collection) was transformed into yeast and positive library clones were 
mapped by DNA sequencing. To identify post-translational modifications of IpaJ, 
liquid chromatography (LC)-MS/MS was performed as previously described”. To 
detect N-myristoylation of proteins, cells were metabolically labelled with myristic 
acid-azide’’, and purified N-myristoylated proteins were labelled with Alexa Fluor 
647 Alkyne using Click-iT Reaction Buffer (Invitrogen). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Plasmids and cloning. Bacterial effector genes were amplified by PCR from S. 
flexneri M90T and cloned into pEntr/D, generating a Gateway'™ compatible entry 
clone (Invitrogen) according to the manufacturer’s recommendations. Twenty S. 
flexneri effectors were cloned into pEntr/D vector: IpaA, AAK18443.1; IpaB, 
AAK18446; IpaH1.4, AAK18594.1; IpaH2.5, AAK18367; IpaH4.5, AAK18395; 
IpaH7.8, AAK18394.1; IpaH9.8, AAK18544; IpaJ, AAK18440; IpgD, AAK18452; 
IcsB, AAK18450; OspB, AAL72323.1; OspC1, AAL72322.1; OspC2, AAW64906; 
OspD1, AAW64782; OspE1, AAW64916; OspE2, AAW64805; OspF, AAW64770; 
OspG, NP_085391; VirA, AAK18501. For mammalian expression, the effectors 
were then recombined into modified pcDNA3.1 vector carrying N-terminal eGFP 
or mCherry in frame with the Gateway cassette (Gateway™ vector conversion 
system, Invitrogen). For bacterial expression, a 50-amino-acid N-terminal deletion 
of IpaJ (IpaJA50; residues 51-259) was cloned in frame into pGEX-4T1 (GST-tag) 
(Amersham), yielding intact catalytic domain of IpaJ C-terminally fused to GST- 
tag. N-terminal deletion of ARF1 (ARF1A17; residues 18-181) was subcloned by 
PCR into pGEX-4T1 vector. Human GGA1 (GAT domain, residues 76-215) was 
amplified and cloned by PCR in frame into pET28b-MBP-His vector. For com- 
plementation of S. flexneri knockout strains, ipaJ was amplified and cloned by PCR 
into pBad/Myc-His vector to be expressed under arabinose-inducible promoter. 
mCherry was expressed from the rpsD promoter in pDP151 vector in Shigella 
strains. For galactose-induced yeast expression, ipaJ was recombined from pEntr/ 
D into Gateway’ ™-compatible pYes-Dest52 vector (Invitrogen) for expression 
under galactose-inducible promoter. Yeast arfl and arf2 (SGD: S000002351, 
SGD: S000002296) were amplified by PCR from Yep13 genomic clones and cloned 
into p415 vector for yeast expression from a constitutively active promoter 
(GAPDH). Human full-length arf1 was cloned in frame into modified pcDNA3.1 
vector carrying C-terminal mCherry or eGFP. Human GRASP65 was cloned in 
frame into peGFP-N2 vector. For expression of Strep-tagged full-length human 
ARF1 in mammalian cells, the arf1 complementary DNA (cDNA) (Missouri S&T 
cDNA Resource Center; ARF0100000) was first cloned into pEntr/D entry vector. 
The gene was then recombined into modified pcDNA4T/O vector carrying 
Gateway'™ cassette and two repeats of C-terminal Strep-tag (Trp-Ser-His-Pro- 
Gln-Phe-Glu-Lys, IBA GmbH). For expression of leader peptides bearing myris- 
toylation signal sequences derived from c-Src, MARCKs, hVPS15 and GRASP65, 
cDNA sequences encoding 10 N-terminal amino acids (see Fig. 4a) were cloned 
into modified pcDNA4T/O vector carrying C-terminal GFP and two repeats of 
Strep-tag. All site-directed mutations were generated using the QuickChange Site- 
Directed Mutagenesis Kit (Stratagene). More primer and plasmid information is 
available upon request. 

Bacterial infection of cultured cells. For Shigella infections, HeLa cells or HeLa 
cells stably expressing a GFP tag attached to the N-terminal Golgi retention signal 
of N-acetylglucosaminyltransferase I?’ (NAGFP HeLa) were seeded onto cover- 
slips in six-well dishes the day before infection. Sodium butyrate at 5 1M was 
added to NAGFP cells after seeding to stimulate production of Golgi-localized 
GFP marker*'. Shigella strains were inoculated from frozen stocks and grown 
overnight at 30°C in brain-heart infusion media (BHI) (Difco!™, BD Bio- 
sciences). Bacteria were then back-diluted 1:50 and incubated at 37 °C until reach- 
ing Dgoo ~ 0.5-0.6. Bacteria were then washed in 1 X phosphate buffer saline (PBS) 
and incubated at 37°C for 15 min in 0.003% Congo red. Twenty microlitres of 
bacterial suspension were then added to each well of a six-well dish of semi- 
confluent HeLa or NAGFP cells and centrifuged for 10 min at room temperature 
(1000g) to facilitate bacterial adherence. The plates were then incubated for 90 min 
at 37 °C, 5% CO . The media was removed and the wells were washed three times 
with gentamicin (150 jig ml *) followed by three washes with sterile PBS. Fresh 
antibiotic-free DMEM was added to each well after that and cells were incubated 
for an extra 2-4 h (37 °C, 5% CO;). After incubation, the slides were fixed in 3.7% 
formaldehyde and prepared for microscopy. Golgi and F-actin were visualized 
with anti-GM130 (BD Transduction Laboratories) and rhodamine-phalloidin 
(Molecular Probes) respectively. For infections with S. flexneri knockout strains 
complimented with IpaJ (wild type and C/H/D mutants), overnight bacterial 
cultures were diluted 1:50 in BHI containing 2% arabinose and incubated until 
reaching Deoo ~ 0.5-0.6 as described above. Infections were performed in low- 
glucose media containing 2% arabinose to allow expression of IpaJ from arabinose- 
inducible promoter. For infections with S. flexneri strains expressing mCherry, 
ampicillin was added both to BHI and DMEM culture media. 

A protocol for Salmonella infection of HeLa cells was adapted from previous 
studies’. S. typhimurium was grown overnight at 37°C in a glass flask with 
shaking then subcultured (1:30) and grown for 3h. One millilitre was pelleted, 
suspended in 1X PBS and added dropwise at various concentrations to semi- 
confluent HeLa cells in low-glucose DMEM+10% FBS. Cells were incubated 
(37 °C, 5% CO) for 10 min, washed three times with PBS and incubated for an 
extra 15 min (37 °C, 5% COz) in fresh low-glucose DMEM + 10% FBS. Cells were 


washed again with PBS and incubated in DMEM + 10% FBS + 100mgml ! 
gentamicin before fixing with 3.7% formaldehyde at various time points. Cellular 
phenotypes were visualized as described. 

L. monocytogenes was grown and prepared for HeLa infection as described”’. 
Various concentrations of bacteria were added to semi-confluent HeLa cells in 
DMEM + 10% FBS from an overnight culture of Listeria grown in BHI media. 
Cells were centrifuged (250g, room temperature, 5 min) and incubated (37 °C, 5% 
CO.) for 10 min, then washed three times with 1 PBS. Fresh media containing 
10mg ml ’ gentamicin were added and cells incubated for an extra 4-6 h before 
fixation and visualization. 

Gene disruption in Shigella. The virA, ipaJ and mxiD genes were individually 
disrupted using the A red recombinase-mediated recombination system”. Briefly, 
a kanamycin resistance cassette flanked with 50 base pairs homologous to the gene 
of interest (virA, ipaJ or mxiD) was amplified from plasmid DNA (pKD3) (primer 
sequences are available upon request). PCR products were electroporated into 
S. flexneri strain M90T carrying the red recombinase plasmid pKD46. Trans- 
formants were selected by growth on Luria broth agar plates containing kana- 
mycin (50gml~') and simultaneously cured of pKD46 by growth at 42°C 
overnight. The kanamycin resistance gene was eliminated through the introduc- 
tion of the pCP20 helper plasmid that contained the FLP recombinase. Subsequent 
curing of pCP20 was done by growing strains at 42 °C for 5h. Disruption of the 
virA and ipa] genes was confirmed through DNA sequencing of the respective 
genetic loci. To generate the double strain (Shigella AvirA/AipajJ), the ipa] locus 
was disrupted from a Shigella AvirA strain following the protocol described above. 
Shigella infection studies in mice. Female C57B1/6 mice were purchased from the 
National Cancer Institute and used between 6 and 8 weeks of age. For infection, 
S. flexneri M90T or mutants were grown and back-diluted from the stationary to 
early log phase growth (Dgo9 ~ 0.1) in BHI media (BD Biosciences) at 37 °C, 
washed and diluted in sterile saline, and administered dropwise (10° c.f.u. in 
30 pl volume) into the external nares of mice anaesthetized with ketamine/xylazine 
as described’. At the indicated time points after infection, mice were euthanized, 
and the recoverable bacterial c.fu. enumerated by plating serial dilutions of the 
lung homogenate on BHI plates. Statistical significance was determined by a one- 
way analysis of variance (P < 0.01) Tukey post hoc test. The amounts of the TNF-a 
and IL-6 in the serum and lung homogenate were enumerated using commercially 
available enzyme-linked immunosorbent assay (ELISA) reagents (R&D Systems). 
All experiments were performed under University of Minnesota or Cincinnati 
Children’s Hospital Institutional Animal Care and Use Committee approved 
protocols (S.S.W.). 

hGH trafficking assay. As previously described”**, HeLa cells (50% confluence) 
were transfected with 1 ug of 4 FKBP-hGH (Ariad Pharmaceutical; source of 
material, D. Bernstein) and either 0.5 ug eGFP-IpaJ, eGFP-VirA or pEGFP con- 
trol plasmid with Fugene6 (Roche). Sixteen hours later, the medium was replaced 
with medium containing AP21998 (final concentration 2 mM) or vehicle control. 
AP21998 was incubated with the cells for 2 h before the supernatant was collected. 
The supernatant was then diluted 100-fold and compared against an hGH 
standard curve (12.5-400 pg ml *) for the quantification of hGH released using 
an hGH enzyme-linked ELISA (Roche). For no drug controls, 100% ethanol (2 ll) 
was incubated with the cells for 2h. 

IpaJ bioinformatics. HHpred was used to detect known Pfam domains (profile 
database search used: pfam version 25.0) with distant structural homology rela- 
tionships to full-length IpaJ (GenInfo Identifier (GI): 12329066) with default 
parameter settings®’. The DUF3335 (C39-like peptidase family) gave a probability 
score of 90.0. One hundred proteins in the Pfam database contain this domain. 
PROMALS3D was used to produce a multiple-sequence alignment with the eight 
IpaJ and 100 DUF3335 family members to identify invariant catalytic residues and 
confirm conserved secondary and hydrophobicity patterns”. 

Recombinant protein expression and purification. Recombinant proteins 
(IpaJA50, ARFIAN17, GGA17¢6_15) were expressed in BL21-DE3 E. coli strains 
by induction with 0.4 mM IPTG for 16 hat 18 °C. Pellets were lysed with His buffer 
(100 mM HEPES, pH 7.5, 300 mM NaCl) or GST buffer (Tris buffer saline (TBS); 
50mM Tris pH7.5, 150mM NaCl, 2mM DTT) supplemented with protease 
cocktail (Roche). Proteins were purified with nickel agarose (Qiagen) or glutathione 
Sepharose (Amersham Biosciences) following the manufacturer’s instructions. 
Eluted proteins were buffer exchanged into TBS using concentration centrifugal 
columns (Millipore); glycerol was added to 15% and the proteins were then stored 
at —80°C. 

Cell transfections, microinjections and fluorescence microscopy. HEK293A, 
HeLa and NAGFP HeLa (see above) cells were transfected using FuGene6 (Roche) 
and incubated for 16-18h. For expression of Strep-tagged proteins, HEK293T 
cells were transfected using calcium phosphate and incubated for 18-24h. Equal 
amounts of DNA were used for co-transfection. Cells were then lysed (lysis buffer: 
20 mM Tris HCl pH 7.5, 1.5 mM MgCh, 350 mM NaCl, 0.5% NP-40, 5% glycerol) 
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and sonicated for 5s. The lysate was then clarified by centrifugation (15,000g, 
10 min) and applied to Strep-Tactin Superflow Plus resin (Qiagen). After incuba- 
tion (90 min, 4 °C) the column was washed three times (washing buffer: 20 mM 
Tris HCl pH 7.5, 1.5mM MgCl, 150mM NaCl, 0.2% NP-40, 5% glycerol) and 
Strep-tagged proteins were eluted with 2.5 mM desthiobiotin (100 mM Tris-HCl, 
150mM NaCl, 1mM EDTA, 2.5mM desthiobiotin, pH 8.0). Microinjections of 
IpaJA50 were performed using a semiautomatic InjectMan NI2 micromanipulator 
(Eppendorf). Recombinant proteins were diluted in 1x TBS with Cascade Blue 
(Invitrogen) fluorescent dyes until the final concentration of the protein of 1 mg 
ml | (21 1M). Cellular concentration of microinjected protein was estimated as 
1 uM. For IpaJ C64A and ARF1 co-localization studies, HeLa cells were treated 
either with Brefeldin A (2.5 ug ml ') for 20 min or with nocodazole (10 lug ml ') 
for 1h before fixation. Immunofluorescence images in all experiments were 
acquired with Zeiss AxioVert 200 fluorescence microscope. F-actin was visualized 
by rhodamine- or Alexa Fluor 350-conjugated phalloidin (Molecular Probes), 
Golgi was detected by GM130 antibodies (BD Biosciences) and secondary anti- 
mouse IgG antibodies (Thermo Scientific). NAGFP HeLa cells stimulated for 
production of GFP marker were additionally stained with anti-GFP antibodies 
(Clontech) and fluorescein-conjugated anti-rabbit IgG secondary antibodies 
(Thermo Scientific) to enhance the fluorescent signal. 

Galactose-induced yeast growth inhibition and yeast multicopy suppressor 
screen. Yeast Invscl strain was transformed with pYes-dest52 (Invitrogen) vector 
carrying ipa] gene (wild type or C/H/D catalytic mutant) under the GAL1 indu- 
cible promoter. Yeast was streaked onto Yc-U agar media containing glucose or 
galactose and rafinose as carbon source. Yeast was cultured for at 30 °C for 3-5 
days and the plates were visually inspected for growth. For the yeast multicopy 
suppressor screen, the ipaJ gene was stably integrated into Y7092 yeast strain 
genome (MAT« can1A::STE2pr-Sp_his5 his3A1leu2A0ura3A0met15A0 lyp1A) 
as previously described**. The resulting strain Y7092-IpaJ (MATa can1A::STE2pr- 
Sp_his5 his3A 1leu2A0ura3A0met 15A0 lyp1A trp1A::GAL1-IpaJ-URA3) carryied 
a single copy of the ipaJ gene under control of GAL] galactose-inducible promoter. 
Y7092-IpaJ cells were then transformed with the S. cerevisiae genomic library in 
the Yep13 vector (ATCC) and transformants were selected for survival on Yc-UL 
media containing 2% galactose and 1% raffinose as carbon sources. Positive clones 
were isolated and library vectors were sequenced. For the complementation assay, 
PCR-amplified yeast arfl and arf2 were expressed in yeast under GAPDH pro- 
moter (see above). 

Top-down mass spectrometry. Strep-tagged ARF1 protein was co-transfected 
with IpaJ or control vector. ARF1-strep was purified with a Strep-Tactin Superflow 
Plus column and eluted with 2.5mM desthiobiotin buffer (see above). Purified 
ARF 1-strep was used for LC-MS/MS analysis. 

LC-MS/MS was generally as previously described*’. Optima-grade solvents and 
acids (Thermo Scientific) were used. Reverse-phase liquid chromatography 
(RPLC) capillary columns were packed in-house to a length of 15cm with 5 pm 
diameter C18 Poroshell-300 resin (Agilent Technologies) in 75 jum inner diameter 
X 360m outer diameter Picofrit columns (New Objective) with 15 {um inner 
diameter integrated micro-electrospray tips. A capillary column heater (Analytical 
Sales & Services) was used to maintain the column temperature at 60 °C during 
analysis. RPLC mobile phase A included 0.025% TFA, 0.3% formic acid and 5% 
acetonitrile in water. RPLC mobile phase B included 0.025% TFA, 0.3% formic 
acid and 20% isopropanol in acetonitrile. The elution gradient was 0% B at 
0-3 min, 45% B at 3.01 min and 45-60% B from 3.01 to 23 min. Flow was regulated 
by an 1100-nano-LC (Agilent) at a flow rate of 0.5 pl min |. 

Before analysis, the protein was de-salted using a C4 ZipTip (Millipore) as 
described previously and suspended in 0.2% formic acid in water. Approxi- 
mately 1600 fmol of protein was loaded onto the capillary column per injection. 
Analysis was on an LTQ Orbitrap XLTM (Thermo Scientific) in full MS mode at 
60,000 Fourier-transform mass spectrometry resolution, two microscans, maxi- 
mum ion accumulation time of 500 ms, scan range from 800 to 2000 m/z, source- 
induced dissociation = 25 V, tube lens 100V, capillary voltage 50 V, capillary 
temperature 275°C. MS/MS data were collected with the source-induced 
dissociation = 60 V, tube lens 100 V, capillary voltage 50 V, capillary temperature 
375°C. The [M +6H]6+ charge state of ubiquitin was used to tune the mass 
spectrometer. The ‘Xtract’ function in the XcaliburTM (Thermo Scientific) data 
system with a signal:noise of 3 was used to extract intact protein and fragment 
masses from the raw spectral data. All fragmentation data presented were the 
summation of scans across the entire protein elution profile. MS/MS data sets 
were interrogated with Prosight PC 2.0TM (Thermo Scientific). Analysis of the 
MS/MS data against the known ARF1 sequences without the correct modification 
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state led to few or no b-ion matches. Replicate analysis was performed on each 
sample and assigned fragment ions were manually validated in raw data. 
Labelling N-myristoylated proteins by click chemistry and in vitro cleavage 
reaction. E. coli BL-21 cells expressing ARF1-His with or without yeast NMTp 
were grown overnight at 37°C in LB media. Cultures were diluted 1:50 and 
incubated at 37 °C until reaching Dgo9 ~ 0.5-0.6. Myristic acid and azide myristic 
acid were added at the concentrations 50 tM and 5 1M respectively, and cells were 
additionally incubated for 30 min. Protein expression was then induced with 
0.4mM IPTG and cells were further incubated at 37 °C for 3 h. We estimated that 
N-myristoylated ARF1 is bound in a GDP:GTP ratio of 1:1 as assessed by GGA- 
binding assays. Expression of IpaJ or IpaJ C64A mutant was performed by the 
same method without adding myristic acid. ARF1- and IpaJ-expressing cells were 
lysed by sonication for 1 mn separately (lysis buffer: 20mM Tris HCl pH 7.5, 
1.5mM MgCl, 350mM NaCl, 0.5% NP-40, 5% glycerol) and the lysates were 
mixed together and incubated for 30 min at 37 °C. ARF1 was then purified using 
nickel agarose (Qiagen). N-myristoylated proteins were labelled with Alex Fluor 
647 Alkyne using Click-iT Reaction Buffer Kit (Invitrogen) according to instruc- 
tions with modifications. Specifically, click labelling was performed on the pro- 
teins still bound to the column. The column was then washed and proteins 
were eluted with 500mM imidazole/100 mM HEPES containing 1% SDS. The 
N-myristoylation status was analysed by in-gel fluorescence and the equal protein 
load was confirmed by Coomassie stain. For cleavage inhibition, free myristic acid 
or vehicle control (DMSO) was added to bacterial lysate containing ARF1 before 
adding IpaJ. After cleavage reaction, ARF1 was then labelled and purified as 
described above. To test the cleavage of non-myristoylated ARF1, a similar experi- 
mental setup was used with the following modifications: E. coli BL-21 cells expres- 
sing ARF1-His (in the absence of NMTp or exogenous myristic acid) were purified 
with Ni-NTA agarose, eluted with 500 mM imidazole/100 mM HEPES and pro- 
cessed for mass spectrometry. 

For metabolic labelling in mammalian cells, myristic acid-azide (Invitrogen) 
was added to the cell cultures 6h after transfection at final concentration 10 uM 
and incubation was continued overnight. The next day, Strep-tagged proteins were 
purified using Strep-Tactin SuperFlow Plus columns (see above). N-myristoylated 
proteins were labelled with Alex Fluor 647 Alkyne as described above. The column 
was then washed and proteins were eluted with 2.5mM desthiobiotin elution 
buffer. Purified proteins were separated on SDS-polyacrylamide gel electro- 
phoresis gel and transferred onto nitrocellulose paper (Biorad) for probing with 
STREPtactin-HRP (Biorad). Fluorescence was analysed both in SDS-gel and on 
nitrocellulose paper (collectively ‘in-gel fluorescence’). Expression of IpaJ-GFP 
(wild type or catalytic mutants) was confirmed by probing cell lysates with 
anti-GFP antibodies (Clontech) and secondary HRP-anti-rabbit IgG antibodies 
(Invitrogen). Expression of GRASP65- and hVPS15-GFP chimaeric proteins was 
probed in cell lysate, before protein purification. For in vitro cleavage reaction, 
ARF1-strep transfected cells labelled with myristic acid-azide were lysed and 3 1g 
of recombinant IpaJ A50 (wild type or C64A catalytic mutant) were added into 
20 ul of the lysate. After incubation at 37 °C for 30 min, myristoylated ARF1-strep 
was labelled and analysed as described above. For analysis of cleavage of multiple 
proteins, HEK293T cells were cultured in the presence of azide-modified moieties 
(myristic acid, palmitic acid, geranylgeranyl alcohol) according to the Click-iT 
protocol. Cell lysates were incubated with 3 j1g of IpaJ A50 (wild type or C64A) 
for 30min at 37°C. Proteins containing azide-modified moieties were then 
labelled with Alex Fluor 647 Alkyne and purified according to the Click-iT label- 
ling protocol. 
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SIRT6 regulates TNF-a secretion through hydrolysis 
of long-chain fatty acyl lysine 
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The Sir2 family of enzymes or sirtuins are known as nicotinamide 
adenine dinucleotide (NAD)-dependent deacetylases' and have 
been implicated in the regulation of transcription, genome stability, 
metabolism and lifespan”*. However, four of the seven mammalian 
sirtuins have very weak deacetylase activity in vitro. Here we show 
that human SIRT6 efficiently removes long-chain fatty acyl groups, 
such as myristoyl, from lysine residues. The crystal structure of 
SIRT6 reveals a large hydrophobic pocket that can accommodate 
long-chain fatty acyl groups. We demonstrate further that SIRT6 
promotes the secretion of tumour necrosis factor-a (TNF-a) by 
removing the fatty acyl modification on K19 and K20 of TNF-a. 
Protein lysine fatty acylation has been known to occur in mam- 
malian cells, but the function and regulatory mechanisms of this 
modification were unknown. Our data indicate that protein lysine 
fatty acylation is a novel mechanism that regulates protein secre- 
tion. The discovery of SIRT6 as an enzyme that controls protein 
lysine fatty acylation provides new opportunities to investigate the 
physiological function of a protein post-translational modification 
that has been little studied until now. 

It was recently demonstrated that Sirt5, one of the four sirtuins with 
weak deacetylase activity, preferentially hydrolyses succinyl and mal- 
onyl lysine**. The discovery of the novel Sirt5 activity suggested that 
other sirtuins with weak deacetylase activity may use alternative sub- 
strates as well. We therefore set out to investigate whether SIRT6 has 
any new enzymatic activity. SIRT6 has been reported to be important 
for DNA repair, transcriptional regulation of genes important for 
metabolism and immune responses, and for lifespan® *°. In most cases, 
the biological functions have been linked to the sequence-specific 
deacetylase activity of SIRT6 on histone H3K9 and K56, but not other 
peptide sequences”'’”. The deacetylase activity of SIRT6 on H3K9 and 
K56 can indeed be detected in our assay. However, the catalytic effi- 
ciency is low (Table 1), indicating that the deacetylase activity may 
only account for a part of its biological function. 

To discover possible novel activity of SIRT6, we synthesized H3K9 
peptides with different acyl groups (acetyl, malonyl, succinyl, butyryl, 
myristoyl and palmitoyl) and assayed these peptides with recombinant 
SIRT6. SIRT6 can hydrolyse long-chain fatty acyl groups efficiently 
(Fig. la). The k.at/Km for demyristoylation (1,400 s 'M ')is approxi- 
mately 300-fold better than that for deacetylation (4.8s 'M_ '). The 


Table 1 | Catalytic efficiencies of SIRT6 on different acyl peptides 


Acyl peptide Kea (S-*) Kr (uM) keat/ Km (S-* M7") 
H3K9 acetyl 0.0039+0.0006 810+160 48 
H3K9 butyryl 0.0021+0.0004 200+120 10 
H3K9 octanoy! 0.0046 + 0.0005 40+ 10 1.2 x 10? 
H3K9 myristoy! 0.0049+0.0004 34+09 1.4 x 108 
H3K9 palmitoy| 0.0027+0.0002 0.9+04 3.0 x 10° 
TNF-« K19 myristoyl 0.0020+0.0002 24+0.6 8.3 x 102 
TNF-« K20 myristoyl 0.0050+0.0004 45+1.1 1.1 x 10° 


increased catalytic efficiency comes mainly from the decrease in Kp. 
For deacetylation, the K,,, is 810 |1M, but for demyristoylation, the K,, 
is 3.4 UM. The activity of SIRT6 (a class IV sirtuin) is similar to that of 
PfSir2A°, which was classified as a Class III sirtuin'* but lacks the 
conserved Arg and Tyr residues that recognize negatively charged 
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Figure 1 | SIRT6 preferentially hydrolyses long-chain fatty acyl lysine in 
vitro. a, High-performance liquid chromatography traces showing SIRT6- 
catalysed hydrolysis of different acyl peptides based on the H3K9 sequence. 
b, H2BK12 myristoyl peptide can be hydrolysed by SIRT6, whereas the 
corresponding acetyl peptide cannot. Reactions were carried out with 50 1M 
peptide, 1 14M SIRT6, 20 mM Tris pH 8.0, 0.5 mM NAD and 1mM 
dithiothreitol (DTT) at 37 °C for 30 min. 
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succinyl or malonyl groups*. This emphasizes that it is important to 
examine the activity of each sirtuin experimentally as bioinformatics 
predictions may not be sufficient. 

The conclusion that SIRT6 is better at hydrolysing long-chain fatty 
acyl groups was also supported by its ability to hydrolyse myristoyl 
group from other peptide sequences. The deacetylase activity of SIRT6 
was reported to be sequence-specific. Only H3K9 and H3K56 acetyl 
peptides, but not other peptides, were reported to be deacetylated by 
SIRT6*'"”. To test whether the more efficient demyristoylation acti- 
vity allows SIRT6 to use more peptide substrates, we synthesized acetyl 
and myristoyl peptides based on the H2B K12 sequence. Consistent 
with previous reports”''’’, the hydrolysis of the acetyl peptide by 
SIRT6 was undetectable. In contrast, the hydrolysis of the correspond- 
ing myristoyl peptide could be readily detected (Fig. 1b). These results 
confirm the preference of SIRT6 for long-chain fatty acyl groups. 

To determine the structural basis for enhanced SIRT6 activity with 
fatty acyl peptides, we obtained a crystal structure of SIRT6 in com- 
plex with a H3K9 myristoyl peptide and ADP-ribose (ADPR) at 2.2 A 
resolution (PDB 3ZG6, Fig. 2a, Supplementary Fig. 1). The overall 
structure is similar to the published SIRT6 structure’ (PDB 3K35) 
(Supplementary Fig. 1). Residues 2-10 and 166-174 of SIRT6 are 
visible in the H3K9 myristoyl bound structure, whereas the corres- 
ponding regions are missing in the published SIRT6 structure without 
peptide bound" (Supplementary Fig. 2). The way that H3K9 myristoyl 
and ADPR bind to SIRT6 is similar to that seen in other ternary 
complex structures of other sirtuins, such as Sir2Tm in complex with 
the p53 acetyl peptide and NAD (Supplementary Fig. 3). The peptide 
interacts with SIRT6 through many hydrogen-bonding interactions. 
Similar to what was observed with other sirtuins, including Sirt5, most 
of the hydrogen bonds come from main chain C=O and N-H of the 
myristoyl peptide, with the only side chain hydrogen-bonding coming 
from Trp 11 (Supplementary Fig. 4). Therefore, the selectivity for pep- 
tide sequences is not high, which is consistent with our enzymology 
data. The myristoyl group is located in a hydrophobic pocket (Fig. 2b) 
formed by hydrophobic residues from several flexible loops, including 
Ala 11, Pro 60, Phe 62, Trp 69, Pro 78, Phe 80, Phe 84, Val 113, Leu 130, 
Leu 184 and Ile 217. The substrate-binding sites in other available 
human sirtuin structures do not possess such a big hydrophobic pocket 
(Supplementary Fig. 5). The structural data are thus consistent with 
the biochemical data and provide a reasonable explanation for the 
preference of SIRT6 for long-chain fatty acyl groups. 

The next important question was whether this activity is physio- 
logically relevant. There are a few proteins known to be modified by 
fatty acyl groups on lysine, although the function of the modification is 
unclear’®’’. One of the fatty acylated proteins is TNF-«"°, a type II 
membrane protein with a single transmembrane domain linking the 
amino-terminal intracellular domain and the carboxy-terminal extra- 
cellular domain’*. When cleaved by proteases, the extracellular domain 
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is released. The released TNF-« can then bind its receptors and induce 
various signalling pathways’®. It was reported that SIRT6 can regulate 
the synthesis of TNF-«, but the mechanism was unclear’’”®. We pro- 
pose that SIRT6 may regulate TNF-« secretion via defatty-acylation. 

To test this, we measured the fatty acylation level on TNF-« in 
SIRT6 wild-type (WT) and knockout (KO) mouse embryonic fibro- 
blast (MEF) cells. Flag-tagged TNF-« was transfected into the cells. 
The cells were cultured in the presence of an alkyne-tagged fatty acid 
analogue, Alk14, which can covalently label fatty acylated proteins”. 
TNF-o was immunoprecipitated and conjugated to rhodamine-azide 
(Rh-N;) using click chemistry (Fig. 3a). A protein will be fluorescently 
labelled if it is fatty acylated by Alk14. Because Alk14 can also label 
cysteine residues**”’, we treated TNF-o with hydroxylamine to remove 
cysteine fatty acylation (Fig. 3a). As shown in Fig. 3b, TNF-« from 
SIRT6 KO MEF cells had significantly increased Alk14 labelling com- 
pared to TNF-« from SIRT6 WT MEF cells, indicating that SIRT6 
regulates the fatty acylation level on TNF-o%. When human SIRT6 WT 
was overexpressed in SIRT6 KO MEF cells, TNF-o had lower fatty 
acylation level than TNF-a from cells without overexpression of 
human SIRT6, whereas overexpression of human SIRT6(H133Y) cata- 
lytic mutant did not have much effect on TNF-a fatty acylation 
(Fig. 3c), indicating that enzymatic activity of SIRT6 is required for 
controlling TNF-« fatty acylation. 

To further confirm that fatty acylation occurs on lysine residues of 
TNF-a, we mutated K19 and K20, the reported sites of myristoylation 
on TNF-a'°, to Arg (labelled as ‘KR’ mutant in Fig. 3b). The labelling 
intensities of the mutant from SIRT6 KO MEF cells dropped to back- 
ground level (Fig. 3b), confirming that K19 and K20 are the major fatty 
acylation sites. Furthermore, TNF-« isolated from SIRT6 KO MEF 
cells can be defatty-acylated in vitro in an NAD-dependent manner 
(Fig. 3d). Synthetic TNF-« K19 and K20 myristoyl peptides can be 
efficiently hydrolysed by SIRT6 (Table 1). To rule out that TNF-o 
could also be regulated by lysine acetylation, we examined the acetyla- 
tion level using a pan-specific acetyl lysine antibody. No acetylation 
was detected on K19 and K20 of TNF-« (Supplementary Fig. 6). Thus, 
SIRT6 regulates the fatty acylation level, but not acetylation level on 
K19 and K20 of TNF-a. 

We next investigated the function of TNF-a fatty acylation on K19 
and K20. It was reported that the amount of TNF-« detected in the 
media of SIRT6 KO cells is less than that in the media of SIRT6 WT 
cells”. This was attributed to the regulation of TNF-« synthesis by 
SIRT6”. Given that SIRT6 regulates lysine fatty acylation, we propose 
that SIRT6 may regulate the secretion of TNF-«. To test this, Flag- 
tagged TNF-o was transiently transfected into SIRT6 WT and KO 
MEF cells. The amount of secreted TNF-« in the medium and the 
amount of TNF-« in the cells were measured by ELISA (Supplemen- 
tary Fig. 7). The percentage of secreted TNF-« was then calculated. 
TNF-« secretion efficiency was lower in SIRT6 KO MEF cells than in 
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Figure 2 | Structure basis for SIRT6 activity with long-chain fatty acyl groups. a, Overall structure of SIRT6 with myristoyl H3K9 (Myr-H3K9, green) peptide 
and ADP-ribose (ADPR, yellow) bound. b, Hydrophobic residues in SIRT6 that accommodate the myristoyl (Myr) group. 
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Figure 3 | SIRT6 regulates TNF-a. fatty acylation and secretion. a, Method 
of using Alk14 to detect TNF-« fatty acylation. b, SIRT6 controls TNF-o fatty 
acylation on K19 and K20. KR, TNF-o(K19R, K20R) mutant. c, H133 of SIRT6 
is required for TNF-o defatty-acylation. KI, knock-in. d, SIRT6 defatty-acylates 
TNF-z in vitro. e, SIRT6 regulates secretion of TNF-« in MEF cells. n = 6; 


SIRT6 WT MEF cells (Fig. 3E). The secretion efficiency is due to lysine 
fatty acylation because the secretion efficiency of TNF-a KR mutant 
was not affected by SIRT6 knockout. These data indicate that lysine 
fatty acylation regulates TNF-x secretion and SIRT6 promotes TNF-a 
secretion by defatty-acylation. 

We further investigated whether endogenous TNF-« are also regu- 
lated by SIRT6. For this purpose, we used both human THP-1 cells and 
bone-marrow-derived mouse macrophages. Two SIRT6 knockdown 
(KD) THP-1 cell lines and one control KD THP-1 cell line were gene- 
rated (Fig. 3f). TNF-o from SIRT6 KD cells contained more fatty 
acylation than TNF-« from the control KD cells (Fig. 3f). The amount 
of secreted TNF-« in the medium and the amount of TNF-« in the cells 
were measured (Supplementary Fig. 8) and the percentage of secreted 
TNF-« was calculated (Fig. 3g). The secretion efficiency of TNF-« was 
lower in SIRT6 KD THP-1 cells than in control KD cells, especially 
when the cells were supplemented with palmitic acid (Fig. 3g). Similar 
results were obtained for TNF-a in mouse macrophages. TNF-o in 
SIRT6 WT macrophages had lower lysine fatty acylation than in SIRT6 
KO macrophages (Fig. 3h), whereas TNF-« secretion efficiency in 
SIRT6 WT macrophages was higher than in SIRT6 KO macrophages 
(Fig. 3i). 

SIRT6 was reported to be mainly localized in the nucleus’. The 
regulation of TNF-« secretion by SIRT6 indicated that SIRT6 might 
be present in secretory organelles, such as the endoplasmic reticulum. 
We indeed detected SIRT6 in the endoplasmic reticulum fraction of 
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***D <0).001. f-i, SIRT6 regulates fatty acylation level and secretion of 
endogenous TNF-« in THP-1 cells (f and g; n = 3; **P < 0.01) and bone- 
marrow-derived macrophages (h and i; n = 5; *P < 0.02). Secretion data were 
expressed as mean ~ s.d. IP, immunoprecipitation; shRNA, short hairpin 
RNA; WB, western blot. 


MEF and THP-1 cell lysates (Supplementary Fig. 9). In SIRT6 KD 
THP-1 cells or SIRT6 KO MEF cells, less or no SIRT6 was detected 
in the endoplasmic reticulum fraction compared with SIRT6 WT cells 
(Supplementary Fig. 9). The presence of SIRT6 in the endoplasmic 
reticulum provided further support for the conclusion that SIRT6 
regulates TNF-« secretion through defatty-acylation of TNF-«. 

In summary, our enzymological and structural studies show that 
human SIRT6, which has weak deacetylase activities in vitro, catalyses 
the hydrolysis of fatty acyl lysine modifications more efficiently than 
deacetylation. SIRT6 regulates the fatty acylation level on K19 and K20 
of TNF-«% and this modulates the secretion of TNF-«. It has been 
reported that SIRT6 regulates the acetylation level on histone H3K9 
and K56”"*”*, Our results are not in conflict with these reports because 
it is possible that the association with chromatin can regulate the 
activity of SIRT6. Alternatively, fatty acylation of lysine residues 
may occur to histones’, and SIRT6 can regulate both acetylation 
and fatty acylation of histones. The significance of our finding is 
several fold. First, it reveals a novel physiological activity for SIRT6. 
Second, it demonstrates, as far as we know for the first time, that lysine 
fatty acylation, a protein post-translational modification that has been 
little studied until now, has an important role in regulating protein 
secretion. Other proteins (Supplementary Fig. 10), such as insulin-like 
growth factor 1 (IGF1) may be regulated by similar mechanisms*™. 
Third, the discovery of SIRT6 activity on protein lysine fatty acyla- 
tion provides an avenue to further investigate this under-recognized 
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protein modification’®’? and may reveal interesting connections 


to the well-known cysteine palmitoylation and N-terminal glycine 
myristoylation”. 


METHODS SUMMARY 


Expression, purification and crystallization. Human SIRT6 was overexpressed 
in Escherichia coli and purified using Ni-NTA affinity chromatography, ion 
exchange and gel filtration. Crystals were grown from 12% PEG6K, 0.1 M MES 
(2-(N-morpholino)ethanesulphonic acid), pH 6.5. 

Data collection and structure determination. Data were collected at the 
Shanghai Synchrotron Radiation Facility (SSRF). The structure was determined 
by molecular replacement. 

Activity assay of SIRT6 on different acyl peptides. Acyl peptides were synthe- 
sized as previously described’. The hydrolysis of different acyl peptides catalysed 
by SIRT6 was monitored by high-performance liquid chromatography. Kinetic 
parameters were determined by varying the concentrations of the acyl peptides. 
Detection of long-chain fatty acylation on TNF-a. Flag-tagged TNF-o was 
transiently transfected into either SIRT6 WT or KO MEF cells’. Endogenous 
TNF-« in THP-1 cells and SIRT6 WT or KO mouse macrophages was stimulated 
by LPS. The fatty acyl lysine modification was detected using alkyne-tagged fatty 
acid analogue Alk14 using a procedure modified from a previously published 
method”. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Reagents. Mouse monoclonal anti-Flag M2 antibody conjugated with horseradish 
peroxidase, anti-Flag M2 affinity gel, and human/mouse SIRT6 antibody (S4322) 
were from Sigma. Human TNF-« antibody (D5G9) and mouse TNF-« antibody 
(D2D4) were from Cell Signaling Technology. TNF-« Affibody immobilized on 
agarose (ab31909) and human SIRT6 antibody (ab88494) were from Abcam. 
Lamin A/C antibody (636), B-actin antibody (C4) and GRP 78 antibody (H-129) 
were from Santa Cruz Biotechnology. The rabbit pan-specific anti-acetyl lysine 
antibody was from ImmuneChem Pharmaceuticals. The goat anti-rabbit and 
mouse IgG conjugated with horseradish peroxidase and protein A/G plus-agarose 
were from Santa Cruz Biotechnology. Human and mouse TNF-o ELISA kits were 
from eBioscience. Brefeldin A (BFA), phorbol 12-myristate 13-acetate (PMA), 
lipopolysaccharides from Escherichia coli 0111:B4 (LPS) and palmitic acid were 
purchased from Sigma. Alk14 was synthesized according to reported procedures”. 
SIRT6 WT or KO MEF cells were generated as previously reported’. THP-1 cells 
were purchased from ATCC. 

Cloning, expression and purification of full-length SIRT6 for activity assay. 
The open reading frame of full-length human SIRT6 (1-355) was inserted into a 
pET28a vector between the BamHI and NotI sites. This plasmid was transformed 
into E. coli Arctic Express (DE3) cells. The cells were cultured at 37 °C in 2 X YT 
culture medium (5 g of NaCl, 16g of bactotrypton and 10g of yeast extract per 
litre). Isopropyl-B-p-1-thiogalactopyranoside (IPTG, 0.2 mM) was used to induce 
expression when attenuance Dgo0 nm was 0.6, and the culture was grown for 20h 
at 289K. Cells were collected by centrifugation at 7,330g for 10 min and then 
re-suspended in lysis buffer (20mM Tris-HCl, pH 7.2, 500mM NaCl and 2% 
glycerol). Cells were lysed using a cell disrupter. After centrifugation at 29,300g 
for 25min at 277K, the supernatant was loaded onto a nickel column (QFF- 
Sepharose, Amersham Biosciences) pre-equilibrated with 20mM_ Tris-HCl 
pH7.2 with 500mM NaCl. The protein was eluted with a linear gradient of 
imidazole (0-500 mM). The desired fractions were pooled, concentrated and buf- 
fer exchanged to cation exchange buffer (20mM Tris pH 7.2, 80mM NaCl, 5% 
glycerol). The protein was loaded onto a cation exchange column (Amersham 
Biosciences) and eluted with 1 M NaCl, 20 mM Tris-HCl, pH 7.2, 2% glycerol. The 
purified protein was stored at —80 °C. 

Cloning, expression and purification of truncated SIRT6 for crystallization. 
Human SIRT6 (1-294) was inserted into a pET28a vector between the NdeI and 
Not! sites. The protein was overproduced at 37 °C in E. coli Rosetta (DE3) strain 
using 2 X YT medium. The expression and purification method was the same as 
that used for the full-length SIRT6 except that after Ni column purification the 
protein was further purified by gel filtration using a Superdex75 column 
(Amersham Biosciences). The protein was eluted with 20 mM Tris-HCl, pH7.0, 
100mM NaCl. After concentration to 8 mg ml’, the target protein was frozen at 
—80°C. 

Synthesis of acyl peptides. The peptides were synthesized and purified as 
described previously’ The identity of the peptides was confirmed using LCQ 
Fleet ThermoFisher Mass Spectrometer. The acetyl, butyryl peptides were dis- 
solved in 25% (v/v) DMSO in water. The longer chain fatty acyl peptides were 
dissolved in DMSO. The concentrations of peptides were determined at 280 nm 
using extinctions coefficient of the two tryptophans attached at the C termini of the 
peptides. 

Deacylation activity assay. The activity of SIRT6 was analysed using reverse 
phase high-performance liquid chromatography on a Kinetex XB-C18 column 
(100A, 75 mm X 4.60 mm, 2.6 jum, Phenomenex). SIRT6 full length (1 4M) was 
incubated in a reaction mixture (60 ul) containing 20 mM Tris pH 8.0, 1 mM DTT, 
0.5mM NAD and 50 uM H3K9 acyl peptides at 37 °C for 30 min. Total DMSO 
content in the reaction was <2.5% unless mentioned otherwise. The reactions 
were quenched with 60 ul of 0.5N HCl in methanol. The reactions were then 
monitored by high-performance liquid chromatography as described later in 
the kinetics assay. 

Kinetics assay for acetyl and butyryl peptides. Peptide concentrations were 
varied from 0 to 25011M for H3K9 butyryl peptide and from 0 to 600 uM for 
the acetyl peptide. The reaction mixtures (60 ul with 2mM NAD, 1mM DTT, 
20 mM Tris, pH 8.0, 4 1M SIRT6 full length, and acyl peptides at various concen- 
trations) were incubated for 30 min at 37 °C.The reaction was stopped using 60 pl 
of 0.5 N HCl in methanol. The reaction mixtures were spun at 18,000g for 10 min 
and were analysed on a Kinetex XB-C18 column (100A, 100mm X 4.60 mm, 
2.6 jm, Phenomenex). The gradient of 20-40% B (acetonitrile with 0.1% TFA) 
in 17 min at 0.5 ml min | was used. 

Kinetics assay for long-chain fatty acyl peptides. The peptide concentrations 
were varied from 1 to 20 uM. The reactions (60 pl with 2mM NAD, 1mM DTT, 
20 mM Tris pH 8.0, 0.2 4M recombinant SIRT6 full length, and acyl peptides at 
different concentrations) was incubated for 15min at 37°C. The reaction was 
stopped using 60 pl of 0.5N HCl in methanol. The reaction mixtures were spun 


at 18,000g for 10 min and were analysed on a Kinetex XB-C18 column (100A, 
75mm X 4.60 mm, 2.6 jum, Phenomenex). The gradient of 0-55% B in 10 min at 
0.5 ml min ' was used. The product and the substrate peaks were quantified using 
absorbance at 280nm and converted to initial rates, which were then plotted 
against the acyl peptide concentrations and fitted using Kaleidagraph. 
Crystallization, X-ray data collection and structure determination. Crystals of 
complex SIRT6 with H3K9-Myr and ADPR were obtained by hanging drop 
vapour-diffusion method at 291K using commercial screens from Hampton 
Research. Each drop consisting of 1 pl of 10mg ml! protein complex solution 
(20 mM Tris-HCl, pH 7.4, 100 mM NaCl, 5mM DTT) and 1 wl reservoir solution 
was equilibrated against 400 il reservoir solution. The qualified crystals of SIRT6 
grew with a cube profile within 1 week with a reservoir containing 12% PEG6K, 
0.1 M MES, pH 6.5. The mixture of 30% glycerol with reservoir solution above was 
used as cryogenic liquor. The X-ray diffraction data were collected at 100K ina 
liquid nitrogen gas stream at the Shanghai Synchrotron Radiation Facility BL17U. 
180 frames were collected with a 1° oscillation and the data were indexed and 
integrated using the program HKL2000”°. The complex structure of SIRT6 with 
H3K9 myristoyl peptide and ADPR was solved by molecular replacement using 
the program Molrep from the CCP4 Suite’ with the published SIRT6 structure 
(PDB: 3K35)°** as the search model. Refinement and model building were per- 
formed with REFMAC5 and COOT from CCP4. The X-ray diffraction data col- 
lection and structure refinement statistics are shown in Supplementary Table 1. 
Generation of SIRT6 KO MEF cells with human SIRT6 WT and H133Y 
mutant knock-in. Human SIRT6 WT or H133Y complementary DNA was 
inserted to lentiviral vector (engineered from pSIN-EF2-OCT4 vector with 
OCT4 deletion to convert into gateway destination vector, and provided by C. 
Zhang) using Gateway Cloning. After co-transfection of SIRT6 WT or H133Y 
lentiviral plasmid, pCMV-dR8.2, and pMD2.G into 293T cells, the medium was 
collected to infect SIRT6 KO MEF cells. The SIRT6 WT or H133Y knock-in cells 
were selected using 1.5,1gml~' puromycin in complete cell culture medium. 
Overexpressed SIRT6 WT and H133Y mutant have a C-terminal V5 tag. 
Cloning of TNF-a. For construction of human TNF-« expression vector to 
express wild-type TNF-« protein (TNF-« WT) with N-terminal Flag tag and 
C-terminal haemagglutinin tag, human full-length TNF-o. cDNA were generated 
by PCR and inserted into pCMV4A vector at EcoRV and Xhol sites. The plasmid 
of TNF-o double lysine mutant (K19R, K20R), TNF-« KR, was made by overlap 
extension PCR. 

Transfection of TNF-a into MEF cells. SIRT6 WT and KO MEF cells were 
maintained in DMEM medium containing glucose and L-glutamine (Invitrogen) 
supplemented with 10% heat-inactivated fetal bovine serum (Invitrogen). The 
pCMV vectors containing target genes were transfected into cells using FuGene 
6 (Promega) according to the manufacturer’s protocol. Empty pCMV vector was 
transfected into cells as negative control. This transfection method was also applied 
to SIRT6 KO MEF cells with human SIRT6 WT or H133Y mutant knock-in for 
TNF-« overexpression. 

Labelling of TNF-a in MEF cells with Alk14. SIRT6 WT or KO MEF cells were 
cultured with fresh medium containing 20 1M Alk14 and 51g ml’ brefeldin A 
for 12h after transient transfection of TNF-a. Cells without Alk14 or without 
overexpression of TNF-« were used as negative controls. Cells were collected by 
cell scraper and resuspended in 1 PBS buffer. After centrifugation at 500g for 
5 min at 4°C, the cell pellet was dissolved in lysis buffer (25mM Tris, pH 7.4, 
150mM NaCl, 10% glycerol, 1% Nonidet P-40) with protease inhibitor cocktail 
(Sigma). The supernatant was collected after centrifugation at 16,000g for 10 min 
at 4°C, and used for western blotting, immunoprecipitation or TNF-« ELISA 
experiment. 

Generation of stable SIRT6 KD THP-1 cells. SIRT6 shRNA lentiviral plas- 
mids in pLKO.1-puro vector were purchased from Sigma. SIRT6 shRNA 1 
(TRCN0000378253) ccggcagtacgtccgagacacagtcctcgaggactgtgctcggacgtactgtttttg 
and shRNA 2 (TRCN0000232528) ccgggaagaatgtgccaagtgtaagctcgagcttacacttgg- 
cacattcttctttttg were used. After co-transfection of SIRT6 shRNA plasmid, 
pCMV-dR8.2 and pMD2.G into 293T cells, the medium was collected to infect 
THP-1 cells. The SIRT6 KD cells were selected using 1.5 4gml' puromycin in 
complete cell culture medium. THP-1 cells infected with lentivirus containing 
control shRNA plasmid were carried out similarly. 

Labelling of TNF-a in THP-1 cells with Alk14. THP-1 cells (with control 
shRNA, SIRT6 shRNA 1 orSIRT6 shRNA 2, at 0.6 X 10° cells ml!) were treated 
with PMA (200 ng ml ') for 24h. Then cells were cultured with fresh medium 
containing Alk14 (50 uM) and LPS (1 pg ml). After 1h, brefeldin A was added 
to culture medium (5 pg ml '). Cells were collected after 10h, and dissolved in 
lysis buffer (25 mM Tris, pH 7.4, 150 mM NaCl, 10% glycerol, 1% Nonidet P-40) 
with protease inhibitor cocktail (Sigma). The supernatant was collected after 
centrifugation at 16,000g for 10 min at 4°C and used for western blotting or 
immunoprecipitation. 
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Isolation of SIRT6 WT and KO macrophages and TNF-a. secretion. Bone- 
marrow-derived macrophages were isolated from SIRT6 WT and KO mice as 
described'* with some modifications. Briefly, femurs and tibiae were dissected 
and marrow tissue eluted by irrigation with DMEM. Cells were cultured in Petri 
dishes containing complete media (DMEM + 20% FCS + 10ng ml’ recombin- 
ant murine M-CSF + penicillin/streptomycin) for 7-10 days at 37 °C in a humidi- 
fied 5% CO atmosphere. For TNF-c mRNA expression analysis, 1.5 X 10° 
macrophages were plated in six-well plates containing complete media and incu- 
bated overnight. Next day, LPS (100 ngml”') was added and the cells were cul- 
tured for 12h. For TNF-« protein expression and secretion, cells were plated as 
above and, after 12 h of LPS stimulation, the supernatants were collected and used 
for ELISA analysis. To measure TNF-o fatty acylation levels, 1.5 < 10° macro- 
phages were plated in 6-cm plates in complete media and incubated overnight. 
Next day, cells were treated using the same procedure as described above for THP- 
1 cells. 

Western blot for TNF-a. Protein samples were separated by 18% SDS-PAGE and 
transferred to PVDF membrane. The membrane was blocked with 5% BSA in 
TBST (25mM Tris, pH7.4, 150mM NaCl, 0.1% Tween-20), incubated with 
antibodies in TBST, and developed in ECL Plus western blotting detection 
reagents (GE Healthcare). The chemiluminescence signal was recorded by 
Storm 860 Imager (Amersham Biosciences) and analysed with ImageQuant TL 
v2005. 

Immunoprecipitation of Flag-TNF-o using anti-Flag affinity gel. Cell lysate 
(250-400 1g from SIRT6 WT or KO MEE cells) was incubated with 20 11 suspension 
of anti-Flag M2 affinity gel for 2h at 4°C. After centrifugation at 500g for 2 min at 
4°C, the affinity gel was washed three times with 500 ll washing buffer (25 mM Tris, 
pH7.4, 150 mM NaCl, 0.2% Nonidet P-40) and used for later experiments. 
Immunoprecipitation of endogenous TNF-a from THP-1 cells. Cell lysate 
(~500 pg) was incubated with 20 pl suspension of TNF-« Affibody immobilized 
agarose for 4h at 4°C. After centrifugation at 500g for 2 min at 4°C, the agarose 
was washed three times with 500 pil washing buffer (25 mM Tris, pH 7.4, 150 mM 
NaCl, 0.2% Nonidet P-40) and used for later experiments. 
Immunoprecipitation of endogenous TNF-a from bone-marrow-derived 
mouse macrophage. Cell lysate (~100 tg from SIRT6 WT or KO macrophages) 
was incubated with 2 jig mouse TNF-o antibody for 2h at 4 °C. Then the mixture 
was incubated with 40 jl suspension of protein A/G plus-agarose for 8h at 4°C. 
After centrifugation at 500g for 2 min at 4 °C, the agarose was washed three times 
with 500 ul washing buffer (25mM Tris, pH7.4, 150mM NaCl, 0.2% Nonidet 
P-40) and used for later experiments. 

Detection of fatty acylation on TNF-a by fluorescence. After immunoprecipita- 
tion, the gel/agarose was re-suspended in 10 jl buffer (25 mM Tris, pH 7.4, 50 mM 
NaCl, 1% Nonidet P-40) for click chemistry. Rh-N3 (in DMF) was added to the 
above suspension to a final concentration of 200 1M, followed by the addition of 
Tris[(1-benzyl-1H-1,2,3-triazol-4-yl)methyl]amine (in DMF, final concentration 
600 uM), CuSO, (in water, final concentration 2 mM), and TCEP (in water, final 
concentration 2 mM). The click chemistry was allowed to proceed at room tem- 
perature for 60 min. The reaction mixture was mixed with protein loading buffer 
(final 2X) and heated at 95 °C for 10 min. After centrifugation at 16,000g for 2 min 
at room temperature, the supernatant was collected and heated with hydroxyla- 
mine (pH 7.2, 60mM) at 95°C for 7 min. The samples were resolved by SDS- 
PAGE using 18% acrylamide gel. Rhodamine fluorescence signal was recorded by 
Typhoon 9400 Variable Mode Imager (GE Healthcare Life Sciences) with setting 
of Green (532 nm)/580BP30 PMT 500 V (normal sensitivity), and analysed by 
ImageQuant TL v2005. 

Detection of acetyl-lysine on TNF-a by western blot using anti-acetyl lysine 
antibody. SIRT6 KO MEF cell lysate (~30 mg), with/without overexpression of 
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TNF-a WT or TNF-o KR, was incubated with 40 pil suspension of anti-Flag M2 
affinity gel for 2 hat 4°C. After centrifugation at 500g for 2 min at 4 °C, the affinity 
gel was washed three times with 500 pl washing buffer (25mM Tris, pH7.4, 
150mM NaCl, 0.2% Nonidet P-40) and then heated with 2 X protein loading 
buffer at 100°C for 10min. The samples were resolved by SDS-PAGE using 
18% acrylamide gel and examined by western-blotting using anti-acetyl-lysine 
antibody. Acetylated BSA was used for positive control to demonstrate acetyl- 
lysine signal by western-blotting. After recording acetyl-lysine signal, PVDF mem- 
brane was washed with water and stained with Coomassie blue to detect TNF-a 
protein. Another western-blotting using anti-Flag antibody was also carried out to 
demonstrate the equal loading of TNF-o, WT and TNF-« KR. 
Defatty-acylation of TNF-a by SIRT6 in vitro. TNF-« WT was immunopreci- 
pitated from lysate of SIRT6 KO MEF cells (~1 mg) overexpressing Flag~TNF-o 
and cultured with Alk14 using 40 pl suspension of anti-Flag M2 affinity gel fol- 
lowing procedure describe above. After washing, the gel was divided into four 
equal aliquots. Each aliquot was re-suspended in 10 pl assay buffer (50 mM Tris, 
pH7.4, 100 mM NaCl, 2mM MgCh, 1% Nonidet P-40) with 20 uM of SIRT6 or 
BSA and with or without 2mM NAD and incubated at 37°C for 1h. Then 
detection of fatty acylation by fluorescence after hydroxylamine treatment was 
carried out with each aliquot as described earlier. 

Secretion of TNF-a and TNF-a KR from SIRT6 WT and KO MEF Cells. After 
transient transfection of TNF-« WT or TNF-« KR, SIRT6 WT and KO MEF cells 
were further incubated with fresh medium for 12 h. Then culture medium and cells 
were collected separately for detection of TNF-« by using human TNF-o ELISA 
kit. Secretion efficiency was calculated by TNF-% amount in culture medium 
versus total amount of TNF-« in culture medium and cells. Six independent 
experiments were carried out. 

TNF-@ secretion from THP-1 cells. Stable shRNA-infected THP-1 cells (at 
0.6 X 10° cellsml') were treated with PMA (200 ng ml ') for 20h. Treatment 
without palmitic acid: cells were incubated with fresh medium containing LPS 
(100 ng ml © '). After 2 h, culture medium and cells were collected for human TNE- 
a ELISA assay. Treatment with palmitic acid: cells were incubated with palmitic 
acid (final 50 uM) in culture medium for 2 h, and then with fresh medium contain- 
ing LPS (100 ng ml ') and palmitic acid (50 1M). After 2h, culture medium and 
cells were collected for human TNF-« ELISA assay. Three independent experi- 
ments were carried out. 

Endoplasmic reticulum fractions from THP-1 cells and SIRT6 WT/KO MEF 
cells. Endoplasmic reticulum fraction was obtained from THP-1 cells, SIRT6 WT/ 
KO MEF cells, or SIRT6 KO MEF cells with human SIRT6 WT or H133Y knock- 
in, using an isopycnic flotation method”. Endoplasmic reticulum fraction was 
solubilized in lysis buffer (25 mM Tris, pH 7.4, 150 mM NaCl, 10% glycerol, 1% 
Nonidet P-40) with protease inhibitor cocktail (Sigma). The supernatant was 
collected after centrifugation at 16,000g for 10 min at 4°C, and used for western 
blotting. 

Statistical analysis. Data were expressed as mean + s.d. (standard deviation, 
shown as error bars). Differences were examined by two-tailed Student’s t-test 
between two groups; and *P < 0.02, **P < 0.01, ***P < 0.001. 
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(S)-2-hydroxypropylphosphonate ((S)-2-HPP) epoxidase (HppE) 
is a mononuclear non-haem-iron-dependent enzyme’ respons- 
ible for the final step in the biosynthesis of the clinically useful 
antibiotic fosfomycin*. Enzymes of this class typically catalyse oxy- 
genation reactions that proceed via the formation of substrate 
radical intermediates. By contrast, HppE catalyses an unusual 
dehydrogenation reaction while converting the secondary alcohol 
of (S)-2-HPP to the epoxide ring of fosfomycin’”. Here we show that 
HppE also catalyses a biologically unprecedented 1,2-phosphono 
migration with the alternative substrate (R)-1-HPP. This trans- 
formation probably involves an intermediary carbocation, based 
on observations with additional substrate analogues, such as (1R)- 
1-hydroxyl-2-aminopropylphosphonate, and model reactions for 
both radical- and carbocation-mediated migration. The ability of 
HppE to catalyse distinct reactions depending on the regio- and 
stereochemical properties of the substrate is given a structural basis 
using X-ray crystallography. These results provide compelling evi- 
dence for the formation of a substrate-derived cation intermediate 
in the catalytic cycle of a mononuclear non-haem-iron-dependent 
enzyme. The underlying chemistry of this unusual phosphono 
migration may represent a new paradigm for the in vivo cons- 
truction of phosphonate-containing natural products that can be 
exploited for the preparation of new phosphonate derivatives. 

Fosfomycin (1) is a clinically useful antibiotic for the treatment of 
limb-threatening diabetic foot infections and lower urinary tract infec- 
tions’. Its biological target is the bacterial enzyme MurA, which cata- 
lyses the first committed step in the biosynthesis of peptidoglycan, the 
main component of the bacterial cell wall®’. Notably, fosfomycin has 
been shown to be effective against ciprofloxacin-resistant Escherichia 
coli®, and methicillin- or vancomycin-resistant strains of Staphylococcus 
aureus”. Fosfomycin is derived from phosphoenolpyruvate (PEP, 2), 
with the phosphonate moiety being generated in a rearrangement reac- 
tion catalysed by PEP phosphomutase (Fom1) (2-3, Fig. 1a)'*”*. This 
is followed by a reaction sequence involving decarboxylation, reduc- 
tion and carbon methylation to produce (S)-2-HPP (4)’*™*. The final 
step of the pathway is the conversion of 4 to 1, catalysed by HppE’”. 

Early studies showed that HppE is a mononuclear non-haem-iron- 
dependent enzyme’” that uses a 2-His-1-carboxylate facial triad as the 
iron ligands, with its overall structural fold belonging to the cupin super- 
family*'*. However, unlike many other enzymes in this class", its activity 
is not dependent on o-ketoglutarate. Instead, HppE uses reducing equi- 
valents derived from NADH to activate molecular oxygen’”. In addition, 
isotope-labelling experiments demonstrated that the oxygen atom of the 
oxiranyl ring in 1 is not derived from O3, but instead from the secondary 
hydroxyl group of 4 (Fig. 1a)'°. Thus, the HppE-catalysed conversion 
of 4 to 1 is in fact a dehydrogenation reaction and not an oxygena- 
tion reaction. 


Previous experiments have also shown that the HppE reaction 
course is dependent on the substrate stereochemistry, as the HppE- 
catalysed dehydrogenation of (R)-2-HPP (5) produces the ketone 6 
rather than an epoxide (Fig. 1b)'”. Recent spectroscopic and crystal- 
lographic studies revealed that both enantiomers of 2-HPP (4 and 5) 
act as bidentate ligands to the mononuclear iron, such that only a single 
hydrogen atom is poised for abstraction by a reactive iron-oxygen 
species*'*°. The direct coordination of the negatively charged phos- 
phonate group to the iron centre probably helps to activate Fe(11) for 
reaction with O, and thus facilitates the formation of higher iron 
oxidation states for substrate oxidation**°”’. These findings prompted 
a more thorough examination of the substrate flexibility and reactivity 
of HppE. Towards this aim, both enantiomers of 1-hydroxypropylpho- 
sphonate (7, 1-HPP) were synthesized and used as mechanistic probes 
of the HppE-catalysed reaction (Supplementary Information). 
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Figure 1 | Fosfomycin biosynthetic pathway and HppE-catalysed 
conversion of various substrate analogues. a, Formation of fosfomycin 

(1) from PEP (2). The oxygen atom of the hydroxyl group of 4 is shown in red to 
highlight its retention in the fosfomycin product. Fom1, PEP phosphomutase; 
Fom2, phosphonopyruvate decarboxylase; FomC, phosphonoacetaldehyde 
dehydrogenase; Fom3, 2-hydroxyethylphosphonate C-methylase. 

b, Conversion of (R)-2-HPP (5) to the corresponding ketone 6. c, Conversion of 
(S)-1-HPP ((S)-7) to acyl phosphonate (8). d, Conversion of (R)-1-HPP ((R)- 
7) to the aldehyde product (9). The 13C-Jabelled carbons are indicated in red. 
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Figure 2 | 'H NMR time-course for the HppE-catalysed conversion of (S)-7 
to 8 (a) and (R)-7 to 9 (b). The peak at 6 2.49 is from deuterated 
dimethylsuphoxide (DMSO-dg), and those (in black) centred between 6 2.50 


Incubation of (S)-7 with HppE produced the acyl phosphonate 8 
(Figs 1c and 2a), a reaction analogous to the dehydrogenation of 5 to 
form the corresponding C2 ketone (6, Fig. 1b). Both reactions are 
consistent with hydrogen-atom abstraction from the oxygen-bearing 
carbon to yield an «-hydroxyalkyl radical that undergoes one-electron 
oxidation to form the corresponding oxo product. By contrast, when 
(R)-7 was treated with HppE, 1-oxopropan-2-ylphosphonate (9) was 
obtained as the sole product (Figs 1d and 2b). The structure of 9 was 
initially determined by NMR, and was verified after NaBH, reduction 
to the more stable product 1-(hydroxymethyl)ethylphosphonate (10) 
and comparison with the chemically synthesized standard”. To fur- 
ther validate the structure of 9, (R)-7 was prepared enriched with °C 
at Cl ((R)-[1-'°C]-7) and used as the substrate in the HppE reaction 
(Supplementary Information). Analysis by '*C NMR demonstrated 
that the large C1-P coupling constant of 150.0 Hz in (R)-[1-°C]-7 
was reduced to 2.7 Hz in 10 following reaction with HppE. These 
results indicate that HppE catalyses a 1,2-shift of the phosphono group 
during catalytic turnover of the substrate analogue (R)-7, for which 
there is, to our knowledge, no enzymatic precedent. 

The crystal structures of Fe(11)-HppE in complex with (R)-7 and 
(S)-7 were determined to 2.71 and 2.80 A resolution, respectively, in 
order to provide a structural basis for the altered stereo- and regios- 
pecificity of the HppE-catalysed reactions with these substrates 
(Supplementary Information). Our results show that (R)-7 and (S)-7 
bind to the iron centre in a bidentate fashion through an oxygen from 
the phosphonate moiety and the C1 hydroxyl (Fig. 3 and Supplemen- 
tary Figs 5 and 6). Three additional coordination sites of the iron are 
filled by the 2-His-1-carboxylate facial triad, leaving one site available 
for dioxygen binding. This site, partially occupied by a water molecule 
in these structures, is the same putative O,-binding pocket to which 
the O-mimic nitric oxide binds’. This spatial arrangement would 
direct the C1 hydrogen of (S)-7 and the pro-R C2 hydrogen of (R)-7 
towards the reactive iron—oxygen species, consistent with the observed 
reaction products 8 and 9. 
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Figure 3 | Structures of (S)-7 and (R)-7 bound to the iron centre of HppE in 
a bidentate mode. a, (S)-7 (carbons in light blue) in F, - F, omit map density 
contoured at 6 (left) and its chemical structure (right), with hydrogen atoms 
accessible for abstraction in red and inaccessible in blue. The putative dioxygen 
binding site on Fe is partially occupied by water molecules (red spheres) in both 
of these structures (Fe to H2O distances are 3.7 and 3.0 A). Colours: Fe in rust, P 
in orange, O in red, N in blue, C in grey. b, (R)-7 (carbons in green) in F, - F, 
omit map density contoured at 6o (left) and its chemical structure (right). 
Hydrogen atoms labelled as in a. 
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To verify the stereochemistry of hydrogen-atom abstraction from 
the prochiral carbon of (R)-7, substrate stereospecifically deuterated 
at the pro-R (11) or pro-S (12) positions of C2 were synthesized 
(Supplementary Fig. 2) and reacted with HppE. 'H NMR analysis of 
the methyl-group splitting pattern of the reaction products from both 
diastereomers (Supplementary Fig. 3) demonstrated retention of only 
the pro-S hydrogen, consistent with the stereospecific abstraction of 
the pro-R hydrogen atom from C2 of (R)-7 (Fig. 4a and Supplementary 
Information) predicted by crystal structural analysis (Fig. 3). 

Although non-enzymatic 1,2-phosphono migrations are known, 
they require either strong Lewis acids” or harsh alkaline conditions™ 
and are generally thought to proceed through carbocationic inter- 
mediates. It is therefore plausible that such an intermediate may also 
be involved in the HppE-catalysed 1,2-phosphono migration of (R)-7 
(Fig. 4b, route a). Following pro-R hydrogen-atom abstraction by a 
reactive iron—-oxygen species to form the C2-centred radical 14, elec- 
tron transfer to the iron centre would produce the corresponding C2 
carbocation 15. This oxidation would then trigger the 1,2-phosphono 
migration in direct analogy to the non-enzymatic reactions. However, 
an alternative route involving a C2 radical-mediated migration of the 
C-P bond to generate the ketyl radical 16 before electron transfer 
could not be excluded (Fig. 4b, route b). 

To gain insight into the chemistry of the defining intermediates of 
these two mechanistic hypotheses, model studies were carried out with 
compounds 17 and 18 (Supplementary Information). As summarized 
in Fig. 4c, when 17 was treated with silver triflate to generate the carbo- 
cation intermediate 19, the migration product 20 was produced. By 
contrast, when the alkyl radical 21 was generated, either by exposure of 
compound 18 to ultraviolet radiation or treatment of 17 with tribu- 
tyltin hydride, formation of alkene 22 was observed. These findings 
provide a correlation between formation of a carbocation intermediate 
and 1,2-phosphono migration. It is also worth noting that the inter- 
mediacy of carbocationic species has been implicated in most biological 
1,2-hydride and 1,2-alkyl shift reactions, such as those catalysed by 
various terpenoid cyclases” and the ‘NIH shift’ of aromatic amino acid 
hydroxylases**. However, the situation could still be different on the 
enzyme, as in the model reaction formation of 22 from 21 may simply 
be more rapid than rearrangement to form 20 in solution. 

Further evidence supporting a carbocationic intermediate in the 
HppE-catalysed reaction was obtained using the substrate analogues 
(1R, 2R)- and (1R, 2S)-1-hydroxyl-2-aminopropyl phosphonate 
((RR)- and (RS)-23; Fig. 4d and Supplementary Information). Using 
a NMR assay and authentic standards (Supplementary Fig. 4 and 
Supplementary Information), HppE was shown to convert both iso- 
mers of 23 to 26 with no detectable formation of the migration or other 
product. This ketone product can be envisioned as forming through 
hydrogen-atom abstraction from C2, yielding an %-aminoalkyl radical 
(24) that is oxidized to the stable C2 iminium ion 25, which can 
undergo spontaneous hydrolysis to yield 26. Hydrogen-atom abstrac- 
tion from C2 and the conversion of both isomers of 23 to 26 are 
consistent with bidentate coordination to the active site iron through 
the Cl hydroxyl and phosphonate moieties, in a manner analogous to 
(R)-7 (Supplementary Figs 7 and 8). Support for the hypothesis that 
the C2 amino group of 23 is unable to sustain the bidendate substrate 
coordination required for catalysis was obtained with the substrate 
analogue 2-aminopropylphosphonate ((+)-27; Fig. 4d), which lacks 
the 1-hydroxyl group of 23 and is not a substrate of HppE (Sup- 
plementary Information). Taken together, these results are consistent 
with a mechanism in which a reactive iron—-oxygen species generated 
during the catalytic cycle of HppE is capable of oxidizing the C2- 
centred radical 24 to the corresponding carbocation 25 during the 
conversion of 23 to 26 (Fig. 4d). 

In summary, this investigation provides new insight into the catalytic 
capability and chemical mechanism of the non-haem-iron enzyme 
HppE. HppE shows remarkable catalytic versatility, converting the (R) 
and (S) isomers of 1-HPP and 2-HPP to aldehyde, acyl phosphonate, 
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Figure 4 | Summary of HppE-catalysed reactions and model chemistry 
examined in this study. a, Stereochemistry of hydrogen-atom abstraction 
from (R)-1-HPP (7) determined using the stereospecifically deuterated 
compounds 11 and 12. The deuterium atom is highlighted in red. 

b, Hypothetical mechanisms for HppE-catalysed 1,2-phosphono migration 
involving cation (route a, red arrows)-mediated and radical (route b, blue 
arrows)-mediated rearrangements. Intermediate 14 could be generated in a 
manner analogous to the formation of 31 from (S)-2-HPP (see Fig. 5) via 
abstraction of the pro-R C2 hydrogen (highlighted in red). ¢, Model reactions to 
probe the involvement of radical or cation intermediates in the 1,2-phosphono 
migration catalysed by HppE. The migration product is only observed when 
cation 19 is formed from 17 using silver triflate. d, HppE-catalysed conversion 
of (RR)-23 and (RS)-23 to the imine hydrolysis product 26, consistent with the 
oxidation of (R)-7 to a cationic intermediate by a reactive iron—-oxygen species 
(red arrow). Box, compound 27 (which lacks the C1 hydroxyl group) is not a 
substrate of HppE, indicating that the amino group is not capable of supporting 
the bidentate substrate coordination required for catalysis. 


ketone and epoxide products, respectively (Fig. 1). This study reveals 
an unprecedented 1,2-phosphono-migration reaction and provides 
support for the existence of carbocation intermediates in the HppE 
reaction. The unique chemistry observed during the phosphono- 
migration reaction catalysed by HppE may represent a new paradigm 
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Figure 5 | Possible mechanisms for the HppE-catalysed epoxidation of (S)- 
2-HPP (4) involving C1 cation formation (route a, red arrow) or oxygen- 
atom rebound (route b, blue arrows). The oxygen atom of the hydroxyl group 
of 4 is shown in red to highlight its retention in the fosfomycin product. 


for the rearrangement of C-P bonds in nature, which can be exploited 
for the synthesis of new phosphonate-containing natural products. 
Moreover, the probable use of a carbocation intermediate in the 
HppE-catalysed conversion of 23 to 26 buoys the proposal of a carbo- 
cation (15) intermediate in the conversion of (R)-7 to 9 (phosphono- 
migration product in Fig. 1d), and prompts re-examination of the 
mechanism of HppE in general. In terms of the types of iron-oxygen 
species used by HppE, '*O kinetic isotope effect studies with the nat- 
ural substrate, (S)-2-HPP (4), demonstrate partially rate-limiting 
formation of a Fe(111)—-hydroperoxo intermediate (30)”’, which impli- 
cates Fe(111)-superoxo as the species that probably abstracts the hydro- 
gen atom from the substrate (29-30; Fig. 5)°*’°. At first glance, the 
C1-H bond of (S)-2-HPP (29 in Fig. 5) might seem more activated 
than the C2-H bond of (R)-7 (Fig. 4), with the latter bond cleavage 
requiring the use of a highly reactive Fe(Iv)-oxo species. However, 
density functional theory calculations indicate that the phosphonate 
moiety of the substrate provides significant /-stabilization to the C2- 
centred radical (14), such that the bond dissociation energy of the C2- 
H of (R)-1-HPP is actually ~2.7 kcal mol ' less than that of the C1-H 
of (S)-2-HPP (93.8 versus 96.5 kcal mol ') (Supplementary Informa- 
tion). Thus, a Fe(11)-superoxo species may also be used by HppE to 
effect hydrogen-atom abstraction from C2 of (R)-7, allowing the 
Fe(1v)-oxo intermediate generated in a subsequent step to be used 
for the oxidation of the C2 radical to the cationic intermediate 
(1415, Fig. 4b). This chemical logic could apply to HppE in its 
reaction with the natural substrate. With a Fe(1m)-superoxo intermedi- 
ate responsible for hydrogen-atom abstraction, an Fe(Iv)-oxo inter- 
mediate is available to catalyse the more challenging oxidation of a C1 
radical to a carbocation (3132, Fig. 5). Indeed, this may be a more 
common strategy than previously surmised, being used by the growing 
number of identified non-haem-iron-dependent enzymes that initiate 
substrate oxidation using iron-superoxo intermediates”. The forma- 
tion of the Fe(Iv)-oxo intermediate 31 requires the input of a single 
electron at the stage of a highly reactive substrate radical intermediate 
(30). The precise timing of this redox reaction may be accomplished by 
the transfer of an electron from the putative proton-coupled electron- 
transfer pathway of HppE, which comprises several tyrosine residues’. 
The resulting protein radical could then be quenched by the external 
reductant at a subsequent (and not necessarily precisely controlled) 
time to regenerate the active form of the enzyme. Experiments to 
characterize the catalytic mechanism of HppE further are in progress. 
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METHODS SUMMARY 


HppE used in this study was purified as described previously”*. All synthetic and 
enzymatic reaction products were characterized by NMR spectroscopy and/or 
high-resolution mass spectrometry. A typical NMR assay contained 0.25 mM 
HppE, 0.25 mM Fe(NH4)2(SO,4)3°6H20, 7.5 mM flavin mononucleotide (FMN), 
25 mM substrate ((R)-7, (S)-7, 11, 12, (RR)-23, (RS)-23 or 27) and 25 mM NADH 
in 700 pl 20 mM Tris buffer (pH 7.5). The conversion of (RR)-23/(RS)-23 to 26 by 
HppE was confirmed by spiking the reaction mixture with an authentic standard 
of 26. Further details regarding experimental procedures and density functional 
theory calculations are described in Supplementary Information. 
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A solution to release twisted DNA during chromosome 
replication by coupled DNA polymerases 


Isabel Kurth’, Roxana E. Georgescu' & Mike E. O’Donnell! 


Chromosomal replication machines contain coupled DNA poly- 
merases that simultaneously replicate the leading and lagging 
strands’. However, coupled replication presents a largely unrecog- 
nized topological problem. Because DNA polymerase must travel a 
helical path during synthesis, the physical connection between lead- 
ing- and lagging-strand polymerases causes the daughter strands to 
entwine, or produces extensive build-up of negative supercoils in 
the newly synthesized DNA”*. How DNA polymerases maintain 
their connection during coupled replication despite these topo- 
logical challenges is unknown. Here we examine the dynamics of 
the Escherichia coli replisome, using ensemble and single-molecule 
methods, and show that the replisome may solve the topological 
problem independent of topoisomerases. We find that the lagging- 
strand polymerase frequently releases from an Okazaki fragment 
before completion, leaving single-strand gaps behind. Dissociation 
of the polymerase does not result in loss from the replisome because 
of its contact with the leading-strand polymerase. This behaviour, 
referred to as ‘signal release’, had been thought to require a protein, 
possibly primase, to pry polymerase from incompletely extended 
DNA fragments*’. However, we observe that signal release is inde- 
pendent of primase and does not seem to require a protein trigger at 
all. Instead, the lagging-strand polymerase is simply less processive 
in the context of a replisome. Interestingly, when the lagging-strand 
polymerase is supplied with primed DNA in trans, uncoupling it 
from the fork, high processivity is restored. Hence, we propose that 
coupled polymerases introduce topological changes, possibly by 
accumulation of superhelical tension in the newly synthesized 
DNA, that cause lower processivity and transient lagging-strand 
polymerase dissociation from DNA. 

Coupled leading- and lagging-strand polymerases, as observed in 
phage and bacterial replisomes’*, have marked implications for the 
topology of the daughter DNA products**’. The topological problem 
is based on the a priori fact that polymerases generate a helical product, 
and thus either they must travel a helical path or the DNA product 
must turn behind them. For example, a rotating leading-strand poly- 
merase will take the attached lagging-strand polymerase and wind the 
DNA duplex 360° around the axis of the leading strand, forming 
precatenanes in the daughter helices (Fig. 1a, left)’. At the rate of E. coli 
replication (650 base pairs per second) rotating polymerases quickly 
result in an impossible tangle’. Alternatively, ifthe DNA products rotate 
instead of the polymerases, negative supercoils accumulate (Fig. 1a, 
right). Superhelical tension on the lagging strand may be relieved by 
rotation of single-stranded DNA (ssDNA), but ssDNA-binding (SSB) 
proteins form large superstructures, which probably constrain swivel 
motion”. The energy generated by only three or four supercoils’’ is 
sufficient to disrupt protein-protein and protein-DNA interactions’. 
Thus topological tension could disrupt the replisome, unless the ten- 
sion is periodically released. The supercoils produced by coupled 
polymerases behind the fork are negative supercoils, which can be 
resolved by Topo I and Topo III’. However, Topo III is non-essential, 
and the viability of Topo I mutants has been controversial'*’*. Hence, 


topoisomerases may participate, but are insufficient to remove nega- 
tive supercoils produced by coupled replisomes. 

Interestingly, single-molecule studies demonstrate highly processive 
DNA synthesis (>100 kilobases (kb)) in the absence of topoisome- 
rases, implying that replisomes possess an intrinsic solution to the 
problem of coupled replication’®”’. Indeed, it was originally proposed 
that the topological problem could be solved by transient release of one 
polymerase of a coupled replisome from DNA (Fig. 1b), enabling the 
negative supercoils on both strands to relax’. If the leading polymerase 
detaches from DNA, it will rebind the same primer terminus for con- 
tinued extension. If the lagging polymerase dissociates, it will reattach 
to the same Okazaki fragment and complete it provided a new primed 
site is not yet formed. During extension of longer Okazaki fragments a 
new priming event is more likely to occur; thus a dissociated lagging 
polymerase may rebind the new RNA primer, leaving the original 
Okazaki fragment incomplete (Fig. 1b). In fact incomplete Okazaki 
fragments have been observed in bacterial and phage systems’. 
Premature polymerase dissociation is referred to as ‘signal release’ 
because it has been presumed that the lagging-strand polymerase is 
‘signalled’ by a replisome component to dissociate and alleviate super- 
coil tension*”. 

To gain insight into the topological challenge of coupled DNA 
replication, we developed assays to study signal release. The E. coli 
replisome is assembled on a 5’ biotinylated rolling circle substrate 
and then attached to streptavidin beads (Fig. 2a). The substrate has 
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Figure 1 | The topological problem caused by coupled leading- and lagging- 
strand polymerases. The figure illustrates the topological problem only for the 
leading-strand polymerase. a, The two Pols (yellow) are coupled through the 
clamp loader (green). Pols travel a helical path as they synthesize DNA. As the 
leading Pol spins, the lagging Pol is dragged around this path, twisting the 
lagging strand around the leading strand (left). Alternatively, the DNA turns 
instead (right), generating negative supercoils as indicated by the arrow. In 
either case, the stress can be released by (b) transient dissociation of the leading 
Pol or (c) transient dissociation of the lagging Pol, which could re-bind the 
incomplete Okazaki fragment or a new RNA primer. 


Downstream Okazaki 
fragment 


Lagging-strand 
synthesis 


Vi primer 


The Rockefeller University, Howard Hughes Medical Institute, 1230 York Avenue, New York, New York 10065, USA. 


4 APRIL 2013 | VOL 496 | NATURE | 119 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


a b c d 


& 
Sag [o-**P] dT dA [o-P] dT dA Oligo (nM) 
Sti-+ - st s 


10 

8 

ee 
y10" A) 


Wash, initiate 
replication 


= a 


Lagging-strand 
[o-82P]GATP 


RNA ! 
primer i4 


0.57 


Lagging-strand 
template 
[o-82P]d TTP 


35" 


Stop 


[o-82P]d TTP fo-S*P]dTTP ss © 


Tat t mb ot He OF 


ALL \ 


a 
i=} 


S 
So 


Signal release (%) 


ey 
S 


LL 
012 3 4 5 


Okazaki fragment size (kb) 


S1 digestion 123 12 3 


Figure 2 | Signal release does not require primase and correlates with 
increasing Okazaki fragment length. a, The replisome, consisting of DNA 
helicase (blue), Pol III* (one clamp loader (green) that binds three Pols 
(yellow); only two Pols are shown for clarity) and §-clamps (red) is assembled 
ona 5’ biotinylated rolling circle DNA, then attached to beads. After unbound 
proteins are removed, replication is initiated by adding primase, B, SSB, ATP 
and o.-**P-labelled deoxyribonucleotide triphosphates. Reactions are quenched 
and treated with S1 to cleave gaps left by signal release, before analysis on 


only three nucleotides on either strand, allowing either leading or 
lagging strand to be labelled depending on which radioactive nucleo- 
tide is used. Replication is initiated after wash steps to remove unbound 
proteins, and ssDNA gaps left by signal release are detected by treating 
the a-**P-labelled deoxythymidine triphosphate ({u-**P]dTTP)-labelled 
lagging-strand template with S1 endonuclease to digest the gaps (Sup- 
plementary Fig. 1). If all Okazaki fragments are completely extended, 
the product will be S1-resistant. Conversely, 100% signal release will 
leave gaps between every Okazaki fragment, and S1 digestion will yield 
products of similar size to Okazaki fragments (Supplementary Fig. 2). 
An intermediate length of DNA will result if signal release is less 
than 100%. 

The S1 analysis shows an intermediate-sized [x-**P]dTTP-labelled 
DNA (Fig. 2b, lane 2) that migrates between the length of undigested 
DNA (lane 1) and the size of Okazaki fragments (lane 3). Size analysis 
of S1-treated products shows that about one-third of all Okazaki frag- 
ments are incomplete (Supplementary Fig. 3 and Methods). To val- 
idate the assay, several controls were performed (Supplementary Fig. 
4): (1) S1 digestion eliminates ssDNA produced in a reaction without 
primase (that is, no lagging-strand synthesis); (2) S1 does not cleave 
double-strand Okazaki fragments; (3) ssDNA gaps produced by signal 
release can be filled in by addition of DNA polymerase II (Pol II); (4) 
the presence of complete Okazaki fragments is confirmed using ligase. 

Earlier studies implied that primase triggers polymerase signal 
release*®. To test this hypothesis, we performed the reaction in the 
complete absence of primase, using exogenously added DNA primers 
to initiate Okazaki fragments. However, the result was unchanged 
(Fig. 2c). Studies in the T4 system suggest that a loaded clamp triggers 
polymerase release’. Although clamps cannot be omitted without 
negating lagging-strand synthesis altogether, titrations of the clamp 
(with or without clamp loader) do not influence signal release (Sup- 
plementary Fig. 5). Notably, the assays of Fig. 2 remove excess (unbound) 
polymerase-clamp loader (that is, Pol III*) and DnaB helicase, and 
show that primase, soluble polymerase, clamp loader and helicase 
are not required to trigger polymerase dissociation. 

We find that increasing Okazaki fragment size by lowering DNA 
primer concentration results in more frequent signal release (Fig. 2d, e 
and Supplementary Fig. 6). Okazaki fragments of 1 kb give only 20% 
signal release whereas 5 kb Okazaki fragments give 80% signal release. 
Hence, small Okazaki fragments are more likely to be completed than 
long Okazaki fragments, implying that the lagging-strand polymerase 
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alkaline gels. The newly synthesized leading strand (purple) is the template for 
lagging-strand synthesis (blue), initiated by RNA primers (red). b, Replication 
reactions using [o-**P]dTTP either before (lane 1) or after (lane 2) $1 analysis, 
or using [o-??P]dATP (lane 3). c, Reactions in the absence of primase, using 
800 nM DNA 20-base oligonucleotides to prime the lagging strand. d, Titration 
of DNA 20-base oligonucleotides into reactions. e, Plot of Okazaki fragment 
length versus percentage signal release. 


is not very processive. In contrast, studies on model lagging-strand 
templates show the polymerase has high intrinsic processivity and 
extends a primed 5.4kb X174 ssDNA without dissociating'*””. To 
exclude the possibility that this result was an artefact of using DNA 
primers, we altered Okazaki fragment size by changing the concentra- 
tion of primase”®, but the results were upheld (Supplementary Fig. 7). 
That signal release correlates with long Okazaki fragments was con- 
firmed using Pol II to fill in ssDNA gaps, demonstrating that ssDNA 
gaps are associated with long Okazaki fragments and yield an average 
gap size of 800 nucleotides (Supplementary Fig. 8). 

To test further for a protein circuit that lowers processivity by 
triggering polymerase release, we designed an assay to measure lag- 
ging-strand polymerase processivity directly. Because priming at a 
replication fork occurs randomly, producing heterogeneous-sized 
products, we attached a 5.4kb primed X174 circular ssDNA to the 
lagging-strand polymerase in trans before initiating replication 
(Fig. 3a). Considering that all replisomal proteins were present, pro- 
tein-induced signal release should still have occurred, and the length of 
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Figure 3 | Lagging-strand polymerase processivity is restored using a 
separate DNA molecule. a, The replisome is assembled on DNA as in the 
legend to Fig. 2a. Unbound proteins are washed away before addition of primed 
X1745.4kb ssDNA. b, Reactions are initiated in the presence (lanes 7-12 and 
19-24) or absence (lanes 1-6 and 13-18) of primase. Supernatants containing 
X174 replication products (lanes 1-12) are separated from bead-bound 
rolling circle products (lanes 13-24). 
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extension from the defined primed site would define the processivity of 
the lagging-strand polymerase. 

A time-course of replication showed a most surprising result 
(Fig. 3b). The 5.4kb X174 DNA was fully replicated by the lagging 
polymerase (Fig. 3b, lanes 1-6 and 13-18), in striking contrast to the 
low processivity during Okazaki fragment synthesis (that is, 80% dis- 
sociation by 5 kb; Fig. 2d, e). We obtained similar results in the pre- 
sence or absence of primase, confirming that neither primase nor 
primed sites trigger signal release (Fig. 3b, lanes 7-12 and 19-24). 
The main difference in reactions with or without the primed @X174 
DNA in trans is the connection of the two polymerases through the 
DNA, indicating that the connection of two polymerases to the same 
DNA molecule causes a decrease in lagging-strand polymerase pro- 
cessivity. One possible non-protein source is the topological strain 
produced in DNA owing to coupled replication (Fig. 1). Lagging- 
strand loops are an unlikely source, as bending of large DNA stretches 
requires very little energy’. To examine the dynamics of lagging- 
strand polymerase action, we turned to single-molecule studies to 
determine whether signal release is characteristic of all replisomes, 
and whether it takes place during long replication times. Replication 
is performed in a flow cell in which the replisome-DNA complex is 
attached to a lipid bilayer. Replicating DNA is held in place at a lipid 
diffusion barrier (Fig. 4a and Methods). Upon initiating replication, 
product DNA is observed using total internal reflection fluorescence 
microscopy and a fluorescent double-stranded DNA (dsDNA) inter- 
calator (see Fig. 4b). We initially performed experiments at 10 jl min’ ', 
which translates into an applied force of 0.23 pN on the DNA, far below 
that needed to disrupt most protein-protein and protein-DNA inter- 
actions’. After replication was complete, we increased the flow to 
100 pil min™* (1.45 pN) to stretch DNA, enabling examination of the 
product for ssDNA gaps, which appeared as regions with decreased 
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fluorescence intensity’® (Fig. 4b, c). The resolution of the CCD (charge- 
coupled device) camera was approximately 0.27 1m per pixel, the 
length of about 880 base pairs of dsDNA. Hence ssDNA gaps should 
have resulted in an uneven pixel density over the length of the molecule 
and facilitated a quantitative evaluation of the ssDNA/dsDNA ratio per 
pixel (Methods). All the molecules examined contained pixel intensity 
differences along their entire length (Fig. 4c and Supplementary Fig. 9). 
If the pixel intensity differences were indeed ssDNA, they should 
have been eliminated upon adding Pol III, which fills gaps, in the flow 
buffer, which was the case (Fig. 4d). Summation of the pixel intensities 
over 298.4 1m of DNA in nine different DNA molecules indicated 
13% ssDNA and 87% dsDNA (assuming SSB-bound ssDNA equals 
dsDNA length’ (Fig. 4e)). This result is within twofold that observed in 
ensemble studies (that is, Okazaki fragments average 2.3 kb, and give 
65% signal release at 10 nM primase; thus an 800-base-pair average gap 
size yields a value of 22.3% ssDNA and 77.7% dsDNA). Overall, these 
results demonstrate that signal release occurs over the length of long 
DNA products and is a characteristic of all replisomes. 

We also performed replication at a tenfold higher flow rate (100 pl 
min |; 1.45 pN) (Fig. 4f, g). Unexpectedly, the amount of ssDNA 
relative to dsDNA increased nearly twofold relative to the 10 pl min * 
flow rate (Fig. 4h), from 13% at 10 ul min | to 22% at 1001 min ' 
(Fig. 4e). An explanation for this difference is that when the lagging- 
strand polymerase undergoes a transient release cycle (signal release), 
the increased velocity of the flow carries away the ‘old’ 3’ terminus, 
making the polymerase more likely to associate with the next RNA 
primer synthesized at the fork (illustration in Fig. 4i) and resulting in 
more ssDNA gaps. 

Notably, previous studies of signal release that suggested a protein 
trigger are also consistent with polymerase release triggered by tor- 
sional constraints owing to coupled replication’. For example, in T4, 
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Figure 4 | Single-molecule total internal reflection fluorescence 
microscopy. a, The replisome is assembled on the rolling circle DNA and 
attached to a lipid bilayer. Replication is initiated upon flowing replication 
buffer at 10 ul min /.b, DNA curtain produced at 10 ll min |. c, Normalized 
line plot intensity of a representative DNA strand. Diagrams at the top illustrate 
dsDNA and ssDNA regions. d, Analysis as in c, but with Pol III* in the buffer 


flow. e, Quantification of line plot analysis at indicated flow rates (n = 9 = s.d. 
for 101 min! or n = 13 + s.d. for 100 1 min‘), f, DNA curtain at 

100 pl min), g, Line plot analysis as inc, at 100 pl min |.h, Histogram relating 
pixel intensity to dsDNA at 10 pl min’! (light blue) and 100 yl min? (dark 


blue). i, Scheme at 100 pl min |, 
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increasing the concentration of clamps may accelerate clamp assembly 
on new RNA primers and increase the probability that a released poly- 
merase binds the new RNA primed site’. In the T7 system, additional 
primase may enhance priming frequency providing more opportunity 
for a dissociated polymerase to bind a new site’. 

Negative supercoiling is required for a variety of DNA metabolic 
actions, including replication, recombination and transcription’*”’. 
Negative superhelicity is a property of genomic DNA in all organisms 
and seems constant over the entire genome”. Hence it is conceivable 
that coupled leading-/lagging-strand replication provides a method by 
which negative supercoils can be placed into DNA across the entire 
genome without topoisomerase action. 

Our data suggest that signal release is caused by DNA topology and 
thus may provide a topoisomerase-independent solution to the topo- 
logical problem incurred by coupled DNA polymerases during rep- 
lication. The results demonstrate that (1) signal release does not require 
primase, (2) the release process is stochastic implying an inherent low 
processivity of the lagging polymerase, (3) signal release requires con- 
nection of both polymerases to the same DNA and (4) addition of 
DNA in trans eliminates signal release in the presence of all replisomal 
proteins. Despite the correlation of premature signal release to the 
topological problem inherent in coupled polymerases, the problem 
of coupled polymerase action is not definitively solved in this paper 
and will require further studies. 


METHODS SUMMARY 


Reactions contained 100 fmol 5’-biotinylated 100-base oligonucleotide rolling 
circle DNA”, 60 1M deoxyadenosine triphosphate (dATP) and deoxyguanosine 
triphosphate (dGTP), 50 14M ATP-y-S, 4 pmol DnaBg, 2.5 pmol 2, and 0.5 pmol 
Pol III* in 20 il replication buffer. Reactions were incubated 5 min at 37 °C before 
immobilization on streptavidin-magnetic beads. Beads were washed three times in 
500 ul replication buffer containing 30 1M dATP, dGTP, 12 pmol f2 and 50 .M 
ATP-y-S before re-suspending in replication buffer containing 601M dCTP, 
60 uM dGTP and 2.5 pmol >. Replication was initiated by adding 0.5 mM ATP, 
480nM SSBy, 501M NTPs and either 604M dATP, 10uM [a-**P]dTTP, or 
60 uM dTTP, 10 uM [o-??P]dATP, and 200 nM primase or DNA 20mer (unless 
indicated otherwise) for 35 s. For S1 analysis, reactions were quenched, and beads 
washed twice in Buffer E and once in $1 buffer. Beads were re-suspended in 20 pl 
S1 buffer. Five-microlitre beads were incubated with 1 U S1 for 7.5 min in 15 pl. 
Single-molecule total internal reflection fluorescence microscopy was performed 
as described", and is further elaborated in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 


Received 11 May 2012; accepted 7 February 2013. 
Published online 27 March 2013. 


1. Kornberg, A. & Baker, T. A. DNA Replication 2nd edn (W. H. Freeman, 1992). 

2. Ullsperger, C. J., Volgodskii, A. V. & Cozzarelli, N. R. Unlinking of DNA by 
Topoisomerases during DNA Replication Vol. 9 (Springer, 1995). 

3. Hingorani, M. M. & O’Donnell, M. Sliding clamps: a (tailored fit. Curr. Biol. 10, 
R25-R29 (2000). 


122 | NATURE | VOL 496 | 4 APRIL 2013 


4. Wyman, C. & Botchan, M. DNA replication. A familiar ring to DNA polymerase 

processivity. Curr. Biol. 5, 334-337 (1995). 

5. Hamdan, S. M., Loparo, J. J., Takahashi, M., Richardson, C. C. & van Oijen, A. M. 

Dynamics of DNA replication loops reveal temporal control of lagging-strand 

synthesis. Nature 457, 336-339 (2009). 

6. Li, X.& Marians, K. J. Two distinct triggers for cycling of the lagging strand 

polymerase at the replication fork. J. Biol. Chem. 275, 34757-34765 (2000). 

7. Yang, J., Nelson, S. W. & Benkovic, S. J. The control mechanism for lagging strand 

polymerase recycling during bacteriophage T4 DNA replication. Mol. Cel/ 21, 

153-164 (2006). 

8. Johnson, A.& O'Donnell, M. Cellular DNA replicases: components and dynamics at 

the replication fork. Annu. Rev. Biochem. 74, 283-315 (2005). 

9. Postow, L., Peter, B. J. & Cozzarelli, N. R. Knot what we thought before: the twisted 

story of replication. Bioessays 21, 805-808 (1999). 

0. Chastain, P. D. Il, Makhov, A. M., Nossal, N. G. & Griffith, J. Architecture of the 

replication complex and DNA loops at the fork generated by the bacteriophage t4 

proteins. J. Biol. Chem. 278, 21276-21285 (2003). 

1. Vologodskii, A. V., Lukashin, A. V., Anshelevich, V. V. & Frank-Kamenetskii, M. D. 

Fluctuations in superhelical DNA. Nucleic Acids Res. 6, 967-982 (1979). 

2. Leu, F. P., Georgescu, R. & O'Donnell, M. Mechanism of the E. coli tau processivity 

switch during lagging-strand synthesis. Mol. Cel! 11, 315-327 (2003). 

3. Espeli,O.& Marians, K. J. Untangling intracellular DNA topology. Mol. Microbiol. 52, 

925-931 (2004). 

4. Stockum, A., Lloyd, R. G. & Rudolph, C. J. On the viability of Escherichia coli cells 

acking DNA topoisomerase |. BMC Microbiol. 12, 26 (2012). 

15. Stupina, V.A. & Wang, J. C. Viability of Escherichia coli topA mutants lacking DNA 

opoisomerase |. J. Biol. Chem. 280, 355-360 (2005). 

6. Yao, N. Y., Georgescu, R. E., Finkelstein, J. & O’Donnell, M. E. Single-molecule 
analysis reveals that the lagging strand increases replisome processivity but slows 
replication fork progression. Proc. Natl Acad. Sci. USA 106, 13236-13241 (2009). 

7. Tanner, N. A. et al. Real-time single-molecule observation of rolling-circle DNA 
replication. Nucleic Acids Res. 37, e27 (2009). 

8. O'Donnell, M. E. & Kornberg, A. Complete replication of templates by Escherichia 
coli DNA polymerase III holoenzyme. J. Biol. Chem. 260, 12884-12889 (1985). 

19. McHenry, C. S. DNA replicases from a bacterial perspective. Annu. Rev. Biochem. 

80, 403-436 (2011). 

20. Wu, C.A., Zechner, E.L., Reems, J. A. McHenry, C.S. & Marians, K. J. Coordinated 

eading- and lagging-strand synthesis at the Escherichia coli DNA replication fork. 

V. Primase action regulates the cycle of Okazaki fragment synthesis. J. Biol. Chem. 

267, 4074-4083 (1992). 

21. Marko, J.F.& Siggia, E.D. Stretching DNA. Macromolecules 28, 8759-8770 (1995). 

22. Dorman, C.J. DNA supercoiling and bacterial gene expression. Sci. Prog. 89, 

151-166 (2006). 

23. Travers, A. & Muskhelishvili, G. A common topology for bacterial and eukaryotic 

ranscription initiation? EMBO Rep. 8, 147-151 (2007). 

24. Benyajati, C. & Worcel, A. Isolation, characterization, and structure of the folded 

interphase genome of Drosophila melanogaster. Cell 9, 393-407 (1976). 

25. Mclnerney, P. & O'Donnell, M. Functional uncoupling of twin polymerases: 

mechanism of polymerase dissociation from a lagging-strand block. J. Biol. Chem. 
279, 21543-21551 (2004). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank L. Langston, A. Libchaber and B. Michel for suggestions 
onthe manuscript. We are also grateful to the National Institutes of Health (GM 38839) 
for supporting this work. 


Author Contributions |.K. and M.O.D. conceived the project, |.K. and R.G. performed 
experiments, and I.K., R.G. and M.O.D. designed the experiments, analysed data and 
wrote the paper. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to M.O.D. (odonnel@rockefeller.edu). 


©2013 Macmillan Publishers Limited. All rights reserved 


METHODS 

Reagents and proteins. Proteins were purified as described: a, ¢, y, t, 6, 6’, x Wb, 9; 
B78, SSB’, Pol III* ((we0)3355’ xy) and y complex (y355’x\y) were reconstituted 
as described”’”’. Solid support bead-based assays were performed using Pol III* 
containing an € mutant (D12A and E14A) that eliminated the 3’—5’ exonuclease 
activity to prevent removal of the 3’ dideoxyterminal nucleotide of the Trap DNA. 
Replication buffer was 20mM Tris-HC1 (pH 7.5), 4% glycerol, 0.1 mM EDTA, 
40 1g ml‘ BSA, 5mM DTT and 10mM MgOAc). S1 buffer was 40 mM NaOAc 
(pH 4.5), 300 mM NaCl and 2mM ZnSO,. 

Rolling circle replication reactions. Twenty microlitres of streptavidin-coupled 
magnetic beads (Invitrogen) were washed and equilibrated in replication buffer. 
Replisomes were assembled in 20 1] replication buffer (20 mM Tris-HC1 (pH 7.5), 
4% glycerol, 0.1 mM EDTA, 40 pg ml | BSA,5mM DTT, 10mM MgOAc;) con- 
taining 100 fmol synthetic 5’-biotinylated 100-base oligonucleotide rolling circle 
DNA (prepared as in”*), 60 1M each dATP and dGTP, 50 uM ATP-y-S, 4 pmol 
DnaBg, 2.5 pmol B, and 0.5 pmol Pol III*. Reactions were incubated 5 min at 37 °C 
before immobilization on beads for 10 min at 25°C. Beads were washed three 
times in 500 ul replication buffer containing 30 uM dATP and dGTP, 12 pmol B, 
and 501M ATP-y-S before re-suspending in 10 ul replication buffer containing 
60 uM dCTP, 60 uM dGTP and 2.5 pmol f. Replication was initiated by addition 
of 10 p11 0.5 mM ATP, 200 nM primase (unless indicated otherwise), 480 nM SSB,, 
50 uM NTPs and either: 60 1M dATP, 10M TTP, 3 pCi [x-*’P]TTP to label 
the lagging-strand template (that is, leading-strand product is the lagging-strand 
template in a rolling circle reaction), or 604M dTTP, 10 pM dATP and 3 pCi 
[x-**P]dATP to label lagging-strand Okazaki fragments. After 15 s, 5 pil replication 
buffer containing 20 nM Trap DNA was added to sequester any polymerase that 
dissociates during replication, thereby preventing it from filling gaps produced by 
signal release. Indicated concentrations reflect final reaction conditions in 25 pl. 
Replication was allowed to proceed for 35 s. The Trap DNA consists of M13mp18 
ssDNA primed with a 3’ dideoxyterminated DNA 30-base oligonucleotide (5'- 
GTTAAAGGCCGCTTTTGCGGGATCGTCACddC-3’) onto which the B clamp 
was pre-assembled in a 2-min reaction at 37 °C containing 4.8 pmol SSB,, 2.5 pmol 
f>, 1 pmol y-complex. Reactions were quenched by adding 25 il of 40 mM EDTA 
and 1% SDS. Where indicated, primase was substituted with 20-base DNA oligo- 
nucleotides (5’-GCAAAAGCAGGCACAGCAAC-3’). Beads were washed twice 
in Buffer E (20 mM Tris-HCI (pH 7.5), 4% glycerol, 0.1 mM EDTA, 40 1g ml 
BSA, 5mM DTT) and once in S1 buffer (200 mM sodium acetate (pH 4.5), 1.5 M 
NaCl and 10 mM ZnSO,). 

$1 endonuclease digestion. Beads were re-suspended in 20 pl S1 buffer and 5 pl of 
the reaction was withdrawn and incubated with 1U S1 nuclease (Fermentas) 
for 7.5 min at 37°C in 15 pl total reaction volume. Reactions were quenched by 
adding 15 pl of 40 mM HEPES (pH 7.5), 40 mM EDTA and 6% SDS. DNA was 
released from beads by incubating at 95 °C for 3 min and resolved on 0.8% alkaline 
agarose gels. 

Control reactions for $1 assay. Where indicated, $1 treatment was performed on 
either 0.2 tg supercoiled pIK31 plasmid DNA, pIK31 linearized with PstI (pIK31- 
PstI), pIK31 nicked on one strand with Nb.BvCI (pIK31-BvCI) or 0.4 tug circular 
M13mp18 ssDNA. Rolling circle replication reactions were performed as des- 
cribed above. Reactions that were treated with Pol II were first replicated and 
quenched as described above, washed three times in replication buffer before re- 
suspending in 25,1 buffer containing 20nM Pol IJ, 604M dCTP and dGTP, 
2.5pmol 2, 1 pmol y-complex and 0.5mM ATP and incubated for 5 min at 
37 °C. Replication was initiated by adding 60 1M dTTP, dATP and incubated for 
10 min at 37°C. Replication for fill-in reactions with Pol II to label gaps was per- 
formed similarly and initiated with dATP, dTTP and (o-**P]dATP, where indicated. 
Ligation of Okazaki fragments. Replication reactions that were treated with T4 
DNA ligase (New England Biolabs) were first replicated in the presence of 
[a-**P]dATP and 800nM 5’-phosphorylated oligonucleotide (5'-P-GCAAAA 
GCAGGCACAGCAAC-3’) for lagging-strand synthesis and quenched as 
described. Beads were washed three times in 20mM Tris-HCl (pH 7.5), 4% 
glycerol, 0.1mM EDTA, 40.gml~’ BSA and 5mM DTT. Beads were re-sus- 
pended in T4 quick ligase buffer (NEB) and 1 ul of T4 DNA quick ligase for 
15min at 25°C. Reactions were quenched by adding 15 ul of 40mM HEPES 
(pH 7.5), 40mM EDTA and 6% SDS. DNA was released from the beads by 
incubating at 95 °C for 3 min and resolved in 0.8% alkaline agarose gels. 
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Analysis of lagging-strand polymerase processivity on primed )X174 ssDNA 
in trans. Replisomes were assembled on the biotinylated rolling circle substrate 
as described above. After the wash steps, beads were re-suspended in 20 ll rep- 
lication buffer containing 8 nM X174 ssDNA (5.4kb) primed with a DNA oli- 
gonucleotide (5'-CAAGCAGTAGTAATTCCTGCTTTATCAAG-3’), 2.4 pmol 
SSB,, 1 pmol f, 0.08 pmol y-complex, 60 4M dCTP and 60 1M dGTP for 2 min 
at 37°C. Beads were immobilized, washed and replication was initiated as 
described above. 

Data analysis. Lanes in alkaline gels were scanned using a phosphorimager with 
ImageQuant software. The intensity of radioactivity at each pixel was normalized 
to the corresponding molecular mass to correct for the fact that longer products 
incorporate more radiolabel. Size markers were used as a reference in the gel. 
Percentage signal release was calculated from the size difference of S1-treated 
leading- and lagging-strand products (fit with a single Gaussian distribution as 
described in the legend to Supplementary Fig. 3) using the following calculation: 


Size (1g) 


——_-— x 100 
Size (d+S1) * 


SR(%) = 
where SR = signal release; size (1g) = size of lagging-strand product; size (Id + 
S1) = size of S1 digested leading-strand product (that is, lagging-strand template). 
Single-molecule replication assays. Replisomes were assembled onto 530 fmol 
(10.5nM) 5’ biotinylated 100-base oligonucleotide rolling circle DNA by first 
incubating with 18 pmol DnaB helicase (365nM) in 50 ul Buffer A (20mM 
Tris-HCl, pH 7.5, 5mM DTT, 40 ug ml! BSA, 4% glycerol) containing 8 mM 
MgOAc and 0.25 mM ATP, followed by incubation for 30 s at 37 °C. Then a 25 pl 
solution of Buffer A containing Pol III* (675 fmol («e0)t366’ Wy, 27 nM as com- 
plex), B2 (1.85 pmol, 74nM as dimer), 60 1M each dCTP and dGTP and 8mM 
MgOAc, was added followed by a further 1 min at 37°C. To immobilize the 
replisome-DNA complex in the flow cell, 1 pl of the above reaction was diluted 
1000-fold into 1 ml of Buffer B (8 mM MgOAc>, 60 1M each of dCTP and dGTP, 
and 50 nM Yo-Prol in Buffer A), then the diluted reaction was passed through the 
flow cell at 500 ul min’ for 30s, and a further 2 min at 10 ul min” '. DNA rep- 
lication was initiated upon flowing Buffer A containing 60 1M of each deoxyri- 
bonucleotide triphosphate, 250 uM each of CTP, TTP and UTP, 1mM ATP, 
462nM SSB,, 100nM primase, 50nM fy, 50mM potassium glutamate, 50nM 
Yo-Prol, 0.8% glucose, 0.01% B-mercaptoethanol, 0.57 U glucose oxidase and 
2.1 U catalase. The flow rate was either 10 jl min ' or 100 jl min", as indicated. 
After 10 min of replication, the flow rate was adjusted to 100 pl min“! to analyse 
replication products. 
Intensity analysis of the DNA products. To correct for flow disturbances in our 
analysis, we averaged between 30 and 40 frames for each DNA strand (over two 
adjacent pixels); the same operation was performed for an adjacent empty region 
to serve as background correction that took into consideration the signal variation 
of non-uniform intensities shown by the bilayer. In addition to this background 
subtraction, we performed a signal normalization. As the DNA strand grows 
longer and extends farther away from the biotin-streptavidin anchoring point 
in the lipid bilayer, there is an additional ‘slow-drift’ in intensity that is due to 
higher strand mobility into the flow. Therefore we normalized the slow-drift 
intensity signal to 100% dsDNA by applying a rank-order filter, a method best 
suited for removing shot noise. This method finds the specified percentile of data 
points in the data window around each point in the data set (here the corrected line 
plot intensity) and replaces that point with the percentile. We analysed the data 
using Matlab software. 
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Structures of protein-protein complexes involved in 


electron transfer 


Svetlana V. Antonyuk'*, Cong Han'*, Robert R. Eady! & S. Samar Hasnain! 


Electron transfer reactions are essential for life because they under- 
pin oxidative phosphorylation and photosynthesis, processes lead- 
ing to the generation of ATP, and are involved in many reactions of 
intermediary metabolism’. Key to these roles is the formation of 
transient inter-protein electron transfer complexes. The structural 
basis for the control of specificity between partner proteins is lack- 
ing because these weak transient complexes have remained largely 
intractable for crystallographic studies**. Inter-protein electron 
transfer processes are central to all of the key steps of denitrifica- 
tion, an alternative form of respiration in which bacteria reduce 
nitrate or nitrite to N, through the gaseous intermediates nitric 
oxide (NO) and nitrous oxide (NO) when oxygen concentrations 
are limiting. The one-electron reduction of nitrite to NO, a pre- 
cursor to N20, is performed by either a haem- or copper-containing 
nitrite reductase (CuNiR) where they receive an electron from redox 
partner proteins a cupredoxin or a c-type cytochrome**. Here we 
report the structures of the newly characterized three-domain 
haem-c-Cu nitrite reductase from Ralstonia pickettii (RpNiR) at 
1.01 A resolution and its M92A and P93A mutants. Very high reso- 
lution provides the first view of the atomic detail of the interface 
between the core trimeric cupredoxin structure of CuNiR and the 
tethered cytochrome c domain that allows the enzyme to function 
as an effective self-electron transfer system where the donor and 
acceptor proteins are fused together by genomic acquisition for 
functional advantage. Comparison of RpNiR with the binary com- 
plex of a CuNiR with a donor protein, AxNiR-cytc551 (ref. 6), and 
mutagenesis studies provide direct evidence for the importance of a 
hydrogen-bonded water at the interface in electron transfer. The 
structure also provides an explanation for the preferential binding 
of nitrite to the reduced copper ion at the active site in RpNiR, in 
contrast to other CuNiRs where reductive inactivation occurs, pre- 
venting substrate binding. 

Conversion of nitrate to N>, known as the denitrification process, is 
performed by several distinct enzymes”. The first committed step, 
NO, +2H' +e — NO+H,O is catalysed by NiR. The well- 
studied two-domain CuNiRs are trimers of 106 kDa, with each mono- 
mer having two domains with a characteristic B-sandwich cupredoxin 
motif. Each monomer has a type-1 Cu (T1Cu) centre with (Cys—-Met- 
His.) ligation and a type-2 Cu (T2Cu) site with (His;—-H2O) ligation at 
the interface between two monomers, both of which provide its 
ligands. During catalysis nitrite binds to the T2Cu, displacing the 
liganded HO. Electrons produced by respiration are transferred to 
the T1Cu site by a partner redox protein. Subsequently electrons are 
transferred by a 12.6A internal His—Cys bridge in a proton-gated 
reaction, to the T2Cu site, where nitrite is reduced to NO”. The 
timing of electron and substrate delivery to the active site of CuNiRs 
is critical to catalysis, because if the water coordinated at the T2Cu site 
of the oxidized enzyme dissociates before nitrite binds to displace this 
water, the enzyme is inactivated’**, 

Recently, two new sub-classes of CuNiRs have been recognized that 
retain the core structure of two-domain enzymes but contain an extra 


cupredoxin or haem-c-containing domain, fused at the amino (N)- and 
carboxy (C)-terminals respectively’*'®. Genome analysis has shown 
that these variant enzymes are quite widely distributed in Nature. The 
cupredoxin N-terminal-fused three-domain NiR from Hyphomicro- 
bium denitrificans (HdNiR) has been biochemically and structurally 
defined'”"*. Surprisingly the observed hexameric structure, against all 
expectations, showed that the Cu site in the extra cupredoxin domain 
was located too far away from the catalytic core for effective electron 
transfer’”. 

The first biochemical and functional characterization of a fused 
three-domain cytochrome c CuNiR, the enzyme from R. pickettii 
(RpNiR)”’, showed it to be trimeric, similar to two-domain CuNiRs 
such as AxNiR. However, RpNiR showed a major difference in that the 
oxidized resting state had a significantly lower nitrite binding affinity. 
Our structure of RpNiR at 1.01 A resolution reported here (Fig. 1) 
provides the structural basis of the functional importance of the haem 
c in RpNiR, where electron transfer from the reduced haem occurs 
during turnover. It also provides an explanation as to how the coordi- 
nated water at the T2Cu site is stabilized, preventing its displacement 
by nitrite, rationalizing the low binding affinity of nitrite to the oxidized 
enzyme. A comparison of the RpNiR structure with the cupredoxin 
N-terminal fused NiR, HdNiR, shows a remarkable conservation of 
the three-dimensional structural organization for these two new classes 
of extended three-domain NiRs (Supplementary Fig. 1). T1Cu of 
the cupredoxin domain of HdNiR can be brought to an appropriate 
distance for electron transfer to the catalytic T1Cu-T2Cu core by a 
simple rotation of the cupredoxin domain. 

Potential electron transfer routes between cupredoxin and haem c 
domains have been clarified to some extent by structural studies of 
several other electron transfer complexes: the transient binary AxNiR- 
Cytcss; complex (1.7 A resolution)*® and the integral membrane com- 
plex caa3-type cytochrome c oxidase (2.36 A resolution)”, for example. 
Our structure of RpNiR clearly defines electron transfer routes 
between a haem c domain and a catalytic cupredoxin core. The fold 
of the trimeric CuNiR core domain is preserved and the haem c 
domain of one monomer is in close proximity to the T1Cu site of 
another monomer at an electron transfer compatible distance of 
10.1 A between the T1Cu ion and the closest haem edge (CBC methyl 
carbon) of the porphyrin (Fig. 2). The linker between haem c and core 
NiR domain forms ten salt bridges, eleven hydrogen bonds with the 
core and nine hydrogen bonds with the haem domain contributing to 
the stability of the haem c domain. These interactions would preclude 
any significant movement of the haem domain from its position that 
favours electron transfer to the T1Cu centre. Comparison of RpNiR 
with the structure of the related but biochemically uncharacterized 
enzyme from Pseudoalteromonas haloplanktis (PhNiR) (Protein 
Data Bank entry 2z00) showed differences in the overall organization 
of the subunits. The RpNiR monomer is more compact, the linker 
between haem and Cu binding domains (residues 315-333) is arranged 
in a helical structure and only two monomers are involved in forming 
the T2Cu site and the interface between the haem and cupredoxin 
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Figure 1 | Structural organization of RpNiR. a, b, Trimer viewed along 

(a) and perpendicular to (b) the three-fold axis. T2Cu (darker blue sphere) 
binds at the interface between two monomers (a, b). T1Cu (lighter blue sphere) 
is located closer to the surface of each monomer in the proximity of the 
cytochrome domain from the adjacent monomer at a distance commensurate 
with efficient electron transfer. c, Ribbon diagram of a monomer with 


domains. In contrast, in PhNiR the linker wraps around the neighbour- 
ing monomer and reaches the distant third monomer, creating the 
cytochrome and catalytic domain interface (Supplementary Fig. 1). 
The significance of these differences awaits the biochemical characteri- 
zation of PhNiR. 


* 


1 3 Water* 


oe 


Figure 2 | Details of interaction between cytochrome and Cu binding 
domains. a, RpNiR. An electron transfer route can be traced from the 
porphyrin CBC atom by a water molecule, the Ne2 atom of the His 143 ligand 
of the T1Cu centre. For the RpNiR, a 2F, — F, map contoured at 1.0¢ for the 
T1Cuand T2Cu sites is shown. Fe-T1Cu distance is 10.4 A, T1-T2Cu is 12.7 A. 
For simplicity, only one conserved water molecule on the shortest route to 
T1Cu is shown at the interface between the cytochrome domain and T1Cu. 
This water is located in very close proximity both to the CBC haem atom (3.6 A) 
and the Ne2 atom of the Cu-ligating His 143 (2.8 A). b, AxNIR-Cytes5) 
complex® with shortest distances between haem and Cu binding domain shown 
as dotted orange lines. The binary complex, AxNIR-Cytcs5), has a hydrophobic 
interface between cytochrome css, and the T1Cu site. 
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cytochrome domain (orange-red), cupredoxin domains (green—blue) 
containing T1Cu and T2Cu and the helical linker structure formed by residues 
315-333 (yellow) between the two (c) and details of the water (red) channel 
linking it to W1 coordinating to T2Cu (d). The stability of the haem c domain in 
RpNiR is also reinforced by the extended interface area of 2267 A” between the 
two monomers in the trimer, of which 452 A? involves the haem c domain. 


A striking feature of the RpNiR structure is the extensive water 
network at the interface of the haem c domain and the surface above 
the T1Cu of the core of the enzyme (Figs 2 and 3a). Several computa- 
tional studies have identified the importance of water molecules hydro- 
gen bonded between the donor and acceptor sites in providing superior 
electronic coupling. This results in substantially enhanced rates of 
electron transfer, as between methylamine dehydrogenase and the 
cupredoxin amicyanin”’. In the case of RpNiR, the interactions between 
the cytochrome and the catalytic-core domains are either C-C interac- 
tions of haem with Met 92 and Pro 93 or are mediated by water hydro- 
gen bonds between the two domains. Similar interactions are seen in 
PhNiR (Supplementary Fig. 2). In contrast to these hydrated interfaces, 
the structure of the transient binary AxNiR-Cytc;;, complex shows 
that the protein-protein interface is a primarily hydrophobic patch 
with a non-polar core sealed off from the aqueous environment®, with 
no water molecules that may be involved in electron transfer. In this 
complex, and in native AxNiR, His 135 (His143 in RpNiR) is protected 
from hydrophilic environment by Met 135 and Trp 138. A similar 
situation is found in the tethered membrane complex caa3-type cyto- 
chrome c oxidase of Thermus thermophilus, where the electron transfer 
pathway from haem c to the Cu, centre does not involve water, but 
rather the D-pyrrole and D-propionate of haem c to the cis amide N and 
a C of Phe 126 of the cupredoxin domain”. 

The potential involvement of Met 92 and Pro 93 in electron transfer 
of RpNiR was tested by individual substitution by Ala. Structures of 
these variants determined at 1.9 and 14A respectively show that the 
conserved water was not perturbed by these substitutions (Fig. 3). 
The mutations had no significant effect on the specific activity of the 
enzyme; and in single turnover experiments where reduced RpNiR was 
reoxidized by nitrite, the electron transfer efficiency from haem to 
T2Cu was similar to the wild-type enzyme, with M92A variant show- 
ing a small decrease in rate (see Supplementary Information). These 
small effects suggest that the conserved water molecule hydrogen 
bonded to the solvent-exposed T1Cu histidine ligand and that the 
carbonyl of the peptide bond of Ala138 plays a dominant role in 
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Figure 3 | Comparison of interactions between cytochrome and Cu binding 
domains in wild type RpNiR, P93A and M92A mutants. a-c, RpNiR 

(a), P93A (b) and M92A (c) in H3 crystal form. A 2F, — F, map contoured at 
1.0¢ at 1.01/1.4/1.8 A resolution for wild type/P93A/M92A. For all three 
structures, the distances between Fe and T1 Cu is 10.1-10.3 A. The interface 
between the cytochrome domain and the T1Cu site has two conserved water 


electron transfer in this tethered complex. The nature of this inter- 
action precludes further testing of its involvement in electron transfer 
by mutation. This role for hydrogen-bonded water in electron transfer 
between the haem and T1Cu site contrasts with the transient binary 
AxNiR-Cytcss; complex, where the close contact between the two 
proteins results in use of C-C interactions between the CBC methyl 
group and Pro 88 of the cupredoxin domain of the core NiR. These two 
electron transfer systems illustrate the two classes of predicted electron 
tunnelling processes: protein mediated and structured water mediated”. 

The structure also provides insights into the mechanism of this 
recently characterized class of CuNiRs. A surprising feature of the 
RpNiR structure was that, despite the similarity to two domain NiRs, 
the usual substrate access channel is closed. Atomic-resolution struc- 
tures of two-domain NiRs have shown the detail of two proton path- 
ways to the active site and the hydrophobic substrate access channel to 
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Figure 4 | Details of the proton pathway in RpNiR and its comparison with 
PhANiR (Protein Data Bank entry 2z00). RpNiR is shown in dark blue 

with water as red sphere, PANiR in light blue. This channel comprises residues 
on the interface between two monomers that extends from the T2Cu catalytic 
centre to the protein surface and is filled with an extensive water network. 

In the case of PhNiR, the channel is interrupted by Ile 235, which has a 
different orientation to the corresponding Ile 245 of RpNiR. Ile 235/245 in 
these extended NiRs correspond to the conserved His 254 residue in the 
two-domain AxNiR”. 
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molecules (W1 and W2) located in very close proximity to the CBC haem atom 
(3.5-3.6 and 3.3-3.4 A respectively); the waters are hydrogen bonded to each 
other. The distances from the W1 molecule to His 143(Né2) and Ala 138(O) are 
approximately 2.8 A in three structures. Extra water W3 is seen in P93A close to 
the mutation site. 


the protein surface*’. Mutagenesis studies of AxNiR established only 
one proton pathway to be functionally relevant, namely the highly 
ordered (Asp 92-water—water-Ala 131-Asn 90-Asn 107) network from 
the T2Cu site (Supplementary Fig. 3) to the protein surface”. This 
hydrophobic channel, approximately 6 A wide formed by residues 
belonging to adjacent monomers, was also assigned as the route of 
substrate access to the catalytic centre. In RpNiR this channel extends 
towards the haem domain, but lacks water molecules as it is blocked by 
residue Tyr 323 that forms part of the linker between cytochrome and 
cupredoxin domains. Instead, the non-functional proton pathway of 
two domain NiRs predominates (Fig. 4 and Supplementary Fig. 1d). 
This channel is present in the structure of PHNiR also. However, in 
PHNiR, Ile 235 adopts a different orientation to Ile 245 of RpNiR and 
interrupts it. This residue may thus be a channel activator and the two 
structures may represent two alternative protein states. 

In contrast to other CuNiRs, the oxidized T2Cu catalytic sites of 
RpNiR have a very low affinity for nitrite’. This does not arise from 
constraints imposed by the altered substrate access channel described 
above because the apparent Michaelis constant for nitrite is 26 uM, 
comparable to other CuNiRs. Structural and kinetic studies of CuNiRs 
indicate that reduction of the T2Cu can result in the loss of the coor- 
dinated H,O ligand to form a catalytically inactive Cu* (His); coordi- 
nation at the active site’**. The water ligated to the T2Cu of NiR in all 
other structures hydrogen bonds to the carboxylate group of the active 
pocket Asp at 2.54-2.83 A and a histidyl nitrogen atom****, In RpNiR 
the coordinated water W1 has two extra hydrogen bonds (Supplemen- 
tary Fig. 4), being connected to a second water molecule (labelled W2) 
as well as O61 of Asp 97 that is also hydrogen bonded to the OH of 
Tyr 323. Water W2 also has a hydrogen bond to the OH of Tyr 323 that 
blocks the putative substrate entry channel identified in two-domain 
NiRs. This would be expected to provide more stability to the coordi- 
nated water molecule, hindering nitrite binding to the oxidized site but 
stabilizing it on reduction to allow nitrite to bind by displacement of 
the water ligand before the catalytically inactive Cu" (His); coordina- 
tion species is formed. 


METHODS SUMMARY 


RpNiR was expressed in Escherichia coli BL21(DE3) and purified as previously 
described’’. For M92A and P92A, site-directed mutagenesis was performed 
following the QuikChange site-directed mutagenesis protocol (Agilent), using 
KOD Hot Start DNA polymerase (Merck). The plasmid pET26b-RpNiR was used 
as template for introducing the following single amino-acid changes in RpNiR by 
site-directed mutagenesis: M92A and P93A. The primers for the M92A mutant 
were (sense) 5'-CACCCGAGCAGCAAGGCGCCGCACAACATCGA-3’ and 
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(antisense) 5'-TCGATGTTGTGCGGCGCCTTGCTGCTCGGGTG-3’. The pri- 
mers for the P93A mutant were (sense) 5’-CGAGCAGCAAGATGGCGCAC 
AACATCGAC-3’ and (antisense) 5’-GICGATGTTGTGCGCCATCTTGCTG 
CTCG-3’. The mutant plasmids were confirmed by DNA sequencing before 
transformation into E. coli strain BL21(DE3). Both mutants were purified follow- 
ing native protein protocol. Crystals for RpNiR and its mutants (H3 crystal form) 
were grown over a month using the hanging drop vapour diffusion method 
at 4°C. Wild-type RpNiR P2,3 crystal form grew in 4 days. Diffraction data were 
collected from the single crystal at 100K using a PILATUS-6M detector at 
Proximal (Soleil), at ID24 and 102 (Diamond). For native protein crystals 
(RpNiR-H3/RpNiR-P2,3), four data sets were collected at 0.98, 1.22, 1.33 and 
1.7A X-ray wavelengths to confirm correct metal incorporation. All data were 
processed and scaled by HKL2000 software’>. Only data sets collected at the 0.98 A 
wavelength were used for structural determination and refinement. Native RpNiR- 
H3 structure was solved by molecular replacement using MOLREP” in the CCP4 
(ref. 27) program suite and PhNiR (Protein Data Bank entry 2zo00) as the search 
model. H3 and P2,3 forms have one and four monomers, respectively, in an 
asymmetric unit. RpNiR-P2,3 was solved by molecular replacement using an 
RpNiR-H3 model. All structures were refined using REFMACS (ref. 28) in the 
CCP4 program suite. Rebuilding of the model between refinement cycles and 
adding waters was performed in Coot”. Data collection and refinement statistics 
are summarized in Supplementary Table 1. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


RpNiR was purified by DEAE ion-exchange chromatography and size-exclusion 
chromatography. The size-exclusion chromatography buffer was 20 mM Tris- 
HCl, pH 8.0, 200mM NaCl. The RpNiR protein was dialysed against size- 
exclusion chromatography buffer plus 0.1 mM CuSO, for reconstitution of type 
2 copper sites. The fraction was then dialysed exhaustively against three changes of 
20 mM Tris-HCl, pH 7.5, 200 mM NaCl with a minimum equilibration period of 
5 h between buffer changes. The protein samples were pooled and concentrated up 
to 15mgml!. 

Crystals for RpNiR and its mutants (H3 crystal form) were grown using the 
hanging-drop vapour-diffusion method at 4 °C temperature from similar crystal- 
lization conditions: 2 ml of protein solution in 20 mM Tris-HCl, 200 mM NaCl 
(pH 8.0) was mixed with an equal volume of reservoir solution containing 20% 
PEG3350, 0.2 M sodium citrate. The buffer for reservoir solution was 20 mM Tris- 
HCl, pH 7.5 for wild-type protein and 20 mM MES pH 6.5 for the mutants. Protein 
concentration was approximately 15 mg ml! H3 crystal form. Wild-type RpNiR 
P2,3 crystal form was grown from 20% PEG3350, 0.2 sodium citrate, 200 mM 
NaCl and protein concentration 7.5 mg ml '. All crystals were flashed cooled in 
the mother liquor plus 10% glycerol solution. 


LETTER 


For RpNiR-H3 and P93A-H3 structures, anisotropic temperature factors were 
refined and riding hydrogen atoms were added to the model. The lower-resolution 
M93A-H3 structure was refined with isotropic B-factors and riding hydrogen 
atoms in the calculated positions. No restrains were used for Cu atoms refinement. 
The quality of the models was assessed using Coot”, Procheck*® and the 
Molprobity” server. Comparison of the Co atom positions in all RpNiR structures 
(RpNiR-H3, RpNiR-P2,3) showed average root mean squared deviation between 
the Cx atoms of approximately 0.2 A, with higher flexibility in the N terminus 
(residues 1-16) and part of cytochrome domain (residues 383-393) atoms. The 
first three residues of the N terminus and the last six residues of the C terminus 
were not visible in the electron density map. Within five monomers in two struc- 
tures (RPNiR-H3, RpNiR-P2)3), the position of the cytochrome domain was 
identical and crystal contacts independent. Analyses of anomalous signal in the 
Cu and Fe sites confirmed correct Cu and Fe incorporation. 
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CAREERS 


Alab retreat offers a chance to 
reinvigorate a research programme p.129 


The latest discussions and 
news on research jobs go.nature.com/z8g4a7 


For the latest career 
listings and advice www.naturejobs.com 


Long non-coding RNA (red) regulates DNA and proteins in the nuclei of cells. 


R 


he genome’s rising stars 


As the once -fringe field of long non-coding RNA moves into the limelight, young 
researchers could reap the benefits. 


BY AMY MAXMEN 


hen Saba Valadkhan lingered in the 
hallways at conferences, absorbed 
in discussions about the strands of 


junk’ DNA that litter the human genome, she 
was not looking for work. She was consumed. 
with curiosity about the possibility that long 
RNA sequences that do not encode proteins 
nevertheless have a function — enhancing 
or suppressing gene expression. Valadkhan’s 
enthusiasm about the budding field of long 
non-coding RNA (IncRNA) did not go unno- 
ticed: senior investigators were on the hunt for 
young researchers willing to pursue the topic. 
“Before I was even looking for job opportuni- 
ties, I was told about people who were hiring,” 


says Valadkhan. Soon after receiving her PhD 
for studies on small nuclear RNA — a type of 
non-coding RNA — at Columbia University 
in New York in 2003, she took a position as 
an assistant professor at Case Western Reserve 
University in Cleveland, Ohio. 

John Rinn, now a molecular biologist at 
Harvard University and the Broad Institute of 
MIT and Harvard in Cambridge, Massachu- 
setts, also had a rapid career launch; as a post- 
doc at Stanford University in California, he 
was noticed at meetings where he spoke about 
his research on how IncRNA silences genes 
involved in embryonic development. Rinn was 
offered several faculty positions, but was sold 
on the Cambridge post when Stuart Schreiber, 
a chemical biologist and a founding member 
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of the Broad Institute, told him: “Every day I 
come to work dreaming of how I will bend the 
genome to my will” Rinn wanted to bend the 
genome with IncRNA to learn how to prevent 
and cure diseases. 

At the time, modifying gene expression using 
IncRNA was not a common goal. In the early 
2000s, most molecular biologists were inter- 
ested in the 1-2% of the human genome that 
encodes proteins, which were presumed to be 
the brokers of biological functions. But with the 
rise of high-throughput sequencing, researchers 
learned that far from being without function, 
much of the rest of the genome was transcribed 
into non-coding RNA, including 20-nucleotide 
microRNAs, which suppress genes, and Inc- 
RNAs of 100 nucleotides or more. 
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> — Last September, the multi-institution 
ENCODE project to catalogue human DNA 
elements (see Nature 489, 46-48; 2012) revealed 
that three-quarters of the human genome is 
transcribed into non-coding RNA, and that 
there may be between 10,000 and 200,000 Inc- 
RNAs. Scientists have shown that these can acti- 
vate gene expression and silence genes, and links 
with disease have begun to emerge. 
Enthusiasm for IncRNA has replaced much 
of the science community's scepticism. Molec- 
ular biology and biochemistry departments 
have taken note ofa flurry of high-impact man- 
uscripts, and some are hiring scientists to work 


in the emerging field. 

Funding is becoming “You start 

easier tofind. Andthe Prajects and 

first biotechnology YOu have no idea 

companies focused what to expect. 

on IncRNA have It’s that kind of 

taken root. challenge that I 
Scientistshopingto enjoy very much 

work in the field will as ascientist.” 


benefit from experi- 
ence with model organisms and cell cultures, or 
from a background in bioinformatics. But they 
also need traits such as creativity, a willingness 
to collaborate and the confidence to weather 
disappointment — in emerging fields, no exper- 
iments are straightforward and foolproof. For 
some researchers, however, these risks are part 
of the allure. “We are part of a very fast-moving 
field,” says Valadkhan. “You start projects and 
you have no idea what to expect. It’s that kind of 
challenge that I enjoy very much asa scientist.” 
Despite the wave of enthusiasm, IncRNA 
studies may still be difficult to launch. “RNA 
has really been given the shaft,” says Tom 
Cech, a chemist at the University of Colorado 
Boulder who shared the 1989 Nobel Prize in 
Chemistry for discovering that RNA can cata- 
lyse reactions. “People were reluctant to hear 
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that RNA could do something other than lead 
to proteins back in the 1970s. But they are still 
surprised to hear it now” 

There are no optimized experimental pro- 
tocols and few clues to the habits of individual 
IncRNAs, so experiments often fail. And when 
they do work, investigators need to go the extra 
mile to convince reviewers that their results are 
real. “We had a difficult time publishing our 
studies for the first couple of years I was at Case 
Western,’ says Valadkhan. “It’s so novel that, 
understandably, people are very sceptical.” 

“T have had so many conversations where 
people think I’m just full of crap,” says Kevin 
Morris, a molecular biologist at the Scripps 
Research Institute in San Diego, California, 
who studies IncRNAs involved in HIV and 
cancer. “You need a thick skin to be in this 
field. You need to do it because you love it” 


LOOKING UP 
Nevertheless, more grants are being funded 
for IncRNA projects. At the US National 
Institutes of Health (NIH) in Bethesda, Mary- 
land, 28 applications on the subject have been 
approved since the start of 2012. By contrast, 
just six were funded between 2008 and 2011. 
Specific calls for ncRNA studies are also 
becoming more common. A 2012 funding 
announcement from the NIH’s National Insti- 
tute on Drug Abuse lists studies on interac- 
tions between IncRNAs and chromatin — the 
cluster of DNA and proteins in a cell’s nucleus 
— asa high scientific priority. A programme 
announcement from the National Cancer 
Institute (NCI) calls for research on IncRNAs 
as biomarkers for cancer. Investigators can also 
submit projects that use IncRNA as a tool to 
approach other questions. For example, Kevin 
Howcroft, a programme director at the NCI, 
notes that people studying prostate cancer 
who might once have looked at proteins in a 
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signalling pathway might now propose to ana- 
lyse IncRNA. 

There are other funding niches. Valadkhan's 
work on neuronal differentiation is funded 
by the ALS Therapy Alliance, a consortium 
based in Needham, Massachusetts, that raises 
money for research related to the neurodegen- 
erative disease amyotrophic lateral sclerosis. 
Increasing the amount ofa single IncRNA ina 
fibroblast, a common cell in connective tissue, 
can transform it into a neuron, so research on 
IncRNA could pave the way for treatments to 
reverse neuronal degeneration, says Valadkhan. 
The California Institute for Regenerative Medi- 
cine in San Francisco, which funds stem-cell 
research, has also awarded grants for research 
into how IncRNAs can change the fate of cells. 


COMMERCIAL INVESTMENT 

The private sector has also begun to notice 
IncRNA. In 2011, start-up company RaNA 
Therapeutics in Cambridge, Massachu- 
setts, received $20.7 million from investors 
including the agriculture biotechnology firm 
Monsanto in St Louis, Missouri. 

RaNAs goal is to manipulate IncRNA to con- 
trol genes such as those that suppress tumours 
or cause cancer. Individual IncRNAs have 
specific targets, so treatments based on them 
might well have more predictable effects than 
those based on other RNAs. And they offer a 
rare chance to activate, rather than just block, 
gene expression. 

In 2012, the company grew from 2 employees 
to 26. President and co-founder Art Krieg says 
that he plans to expand again in 2014 — and that 
he will put a premium on hiring people adept at 
teamwork. At the moment, RaNAs staff scien- 
tists are using high-throughput sequencing and 
a technique called chromatin immunoprecipi- 
tation to learn how IncRNA binds to proteins. 

At least two other biotechnology companies 
focus on ncRNA. OPKO-CURNA, a subsidiary 
of OPKO Health in Miami, Florida, is investigat- 
ing how to activate gene expression, for eventual 
use in treatments. And TransSINE Technologies 
in Yokohama, Japan, aims to produce IncRNA- 
based technologies to express antibodies and 
other proteins for laboratory assays. 

Career success for scientists in the field 
requires knowledge of genomics and molecular 
biology. Some experiments also call for experi- 
ence with model systems and cell culture, but 
Valadkhan, who has a background in biochem- 
istry, learned many techniques on the fly. 

Computational biology is important because 
it allows researchers to grapple with large data 
sets. Institutions including the Wellcome Trust 
Sanger Institute in Hinxton, UK, and the Broad 
Institute offer classes and workshops on bio- 
informatics; these typically range in length 
from days to weeks. Some workshops require 
familiarity with operating systems such as 
Unix and Linux and computing languages such 
as R, but students can learn to use such tools 
through books and online tutorials. 


Universities with financially well- 
endowed genome centres provide easy 
access to sequencing and computer-savvy 
scientists and technicians. The leading US 
institutions for IncRNA research include 
Harvard, the University of Wisconsin- 
Madison, Stanford, the University of Colo- 
rado Boulder and Yale University in New 
Haven, Connecticut. In Europe, institutions 
including the University of Vienna and 
the nearby Research Center for Molecu- 
lar Medicine of the Austrian Academy of 
Sciences have histories of RNA research, 
including IncRNA. 

The Center for Life Science Technologies 
at the RIKEN Yokohama Institute, which 
opened this month, will focus in part on 
IncRNA. “We can develop technology for 
several years without the pressure of writ- 
ing grants and publications because we 
have an institutional budget,” says Piero 
Carninci, the first director of the centre’s 
genomic-technology division, which will 
employ roughly 100 scientists, including 
9 principal investigators. 

Researchers investigating the role of 
IncRNAs in disease often collaborate 
with clinicians. Just such a partnership 
helped Claes Wahlestedt, now director 
of the Center for Therapeutic Innovation 
at the University of Miami Miller School 
of Medicine in Florida, to discover that a 
non-coding RNA drives the expression of 
an enzyme involved in the progression of 
Alzheimer’s disease (M. A. Faghihi et al. 
Nature Med. 14, 723-730; 2008). 

When choosing a lab, researchers should 
remember that it can be an asset to have col- 
leagues with a range of backgrounds, says 
Florian Karreth, a postdoc at Harvard Med- 
ical School in Boston, Massachusetts. “You 
don’t want 20 people with a background in 
microRNA.” In his group, he says, “there 
are people with experience in apoptosis, 
leukaemia and DNA repair, and it’s great to 
learn from all of then”. The 20 postdocs and 
a handful of graduate students and techni- 
cians often confer when starting experi- 
ments, and help each other to learn. 

In the absence ofa rich body of literature, 
ideas are often exchanged at conferences on 
RNA, epigenetics and genomics. As Valad- 
khan discovered, these are also good places 
to find jobs: senior scientists who attend 
may be looking for young investigators with 
creative ideas. Human IncRNAs are yet to 
be catalogued, and everyone wants to know 
more about their role in disease. 

“Tm really looking for people who think 
originally and are very open,” says Carninci. 
“Tt’s a new field, and we know almost noth- 
ing. So it’s important to find people who 
always question the dogma of the day.” = 


Amy Maxmen is a freelance writer in 
New York. 


COLUMN 


Time to reflect 


A lab retreat provides a chance to rethink and advance 
the research programme, says Eleftherios Diamandis. 


tive and energized, and periods of suc- 

cessful discovery may be followed by 
stretches filled with little more than attempts 
to address knowledge gaps or promote transla- 
tion. These are essential processes, but innova- 
tion may suffer if such periods last too long. 

As director of a 25-person research lab (ten 
graduate students, five technologists, six post- 
docs, a research coordinator and affiliates), I 
know that focusing too much on the day-to- 
day business of e-mails, manuscripts and grant 
applications can delay the introduction of new 
techniques and ideas. One way to inject enthu- 
siasm and re-excite the team is through a lab 
retreat to discuss everyone’ projects and work. 

The lab director generally sets the agenda for 
a retreat, but he or she should consider involv- 
ing other lab members. Getting everyone on 
board is important: lab staff should understand 
that this is not an exercise devoted to identify- 
ing the ‘good’ and ‘bad’ projects or people, but 
is rather an open conversation about long-term 
planning to determine who needs help and 
how the director and other lab members might 
give it to them. Retreats can help to identify 
new strategies and areas where lab members 
are duplicating efforts on the same questions. 

The director should circulate instructions 
ahead of time, explaining what he or she 
expects to be covered in the presentations 
and discussion. Constructive criticism should 
dominate. Participants should understand that 
this is not merely an update on research pro- 
gress, but is instead a soul-searching exercise. 
Lab members should be asked to consider self- 
assessment questions such as ‘Am I innovating 
or imitating?; “Will my results lead to signifi- 
cant publications if successful? ‘Are there 
other techniques I should be using?’ and Am 
I being too risk averse?’ Each person should 
address the novelty of their project as opposed 
to simply ‘what they're working on. 

The lab supervisor might open the retreat 
with some remarks on each of the lab’s research 
focuses, addressing the self-assessment ques- 
tions as they pertain to the entire lab. Each 
member can then give a short presentation 
(perhaps five to seven slides) on his or her 
project, with special emphasis on innovations 
and anticipated research impacts. 

Questions and criticisms must go beyond 
those at a typical lab meeting, where most 
questions focus on specific experiments and 


| abs sometimes struggle to stay competi- 
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technical details. At the retreat, questions 
should examine appropriate approaches and 
even whether a project is worth pursuing at all. 
This may reveal that hypotheses or strategies 
have weaknesses that need correction — or that 
suggest that a project should be abandoned. 

To promote debate, the director should con- 
sider assigning one or two lab members the role 
of devil’s advocate. At my lab’s retreat last year, 
one postdoc presented his findings on the anti- 
cancer properties of cardiac glycosides — drugs 
used to treat heart failure and arrhythmia — in 
a model of pancreatic cancer. The opponents 
challenged him, noting that his effective drug 
concentrations were ten times higher than the 
safety limit in humans — it might be possible to 
treat the cancer, said one, but the patient would 
die of cardiac arrest. Discussion followed on 
ways of retaining the anticancer properties 
but avoiding the toxicity. Another critic sug- 
gested that it could be possible to find a drug 
that acts with the cardiac glycoside, enhancing 
its anticancer activity at safer doses. After the 
retreat, this idea led the postdoc to perform a 
high-throughput screen of a 10,000-molecule 
chemical library to find such an agent. He is now 
testing a drug combination in animal models. 

Lab directors are responsible for propos- 
ing and contemplating the direction that a lab 
will take in future. But retreats help to foster 
an annual re-examination of projects. Mem- 
bers need to recognize that the success of a lab 
depends on their capacity to innovate. In the 
end, all will share the rewards. m 


Eleftherios Diamandis is professor of 
laboratory medicine and pathobiology at the 
University of Toronto in Canada. 
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BY CONOR POWERS-SMITH 


hen Allison came in, Jennifer 
muted the TV, and asked, “Well?” 
Allison shrugged. Only when 


she was seated on the sofa, under Jennifer's 
smiling scrutiny, did Allison’s determinedly 
fixed expression give way to a wide grin. 

Jennifer's smile widened in sympathy with 
her roommate's. She prompted, “Good?” 

“Yeah? 

“Whatd you do?” 

“A movie. Walked around for a while.” 
They could’ve been picking up trash by the 
side of the road for all Allison cared. She 
heard herself blurt, “I really like him” 

“I seem to get that impression. But this 
was only date two, right?” 

Allison felt like shed known Kevin for 
years, her whole life, longer. Still, she had to 
nod. “But we're going out again tomorrow.” 

“So this is a thing now,” Jennifer said. 
“Officially.” 

“T guess. I hope” 

“So, don't you think it’s time we checked 
him out?” 

“Jen...” 

“He’ got a profile, hasn't he?” 

“Yeah. You pretty much have to now, or 
youre some kind of weirdo.” 

“So...2” 

“IT don't know” 

“Allie, it’s not being nosy. It's right there for 
everyone to see. It’s just being responsible. I 
mean, if this is going to be a thing, you owe 
it to yourself to go in with your eyes open.” 

“T guess. But —” 

“And admit it: you're just the tiniest bit 
curious.” 

Allison could hardly deny that. “Well... all 
right.” 


Jennifer sat in front of the computer, Allison 
behind her, on the edge of Jennifer’s bed, 
craning her neck to see the screen. Geno-Me 
was slick and inviting, as befitted one of the 
world’s most popular social media sites. The 
stylized double helices bordering each page 
glowed their soft greens and blues. 

“Okay; Jennifer said. “The physical stuff’s 
locked out, unless he’s friended you.” She 
glanced over her shoulder. 

Allison shook her head. “We didnt talk 
much about it.” In her mind, she was review- 
ing the few dismissive words they'd spoken 
on the subject, trying to determine whether 
they‘ established a tacit understanding that 
neither would look at the other’s profile. 
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A little knowledge ... 


Kevin had called it ‘a marginally more 
accurate horoscope’ but didn’t people 
sometimes read horoscopes just for fun? 

“The personality stuff’s more fun 
anyway, Jennifer said. “Okay, let’s see: 
H11Bf, that’s cool. He has a good sense 
of humour?” 

“Definitely” 

“But he has his serious side,” 
Jennifer said. Her hand and wrist 
were busy working the mouse. 
As the cursor slid across the 23 
pairs of rather cartoonish chro- 
mosomes on the screen, small 
sections lit up, boxes of text 
appeared beside them, and other sections, 
seemingly unrelated, glowed in sympathy. 
“And — Uh oh” 

“What?” 

“HOPPER9, B2F11, WELLER-W YMAN 
and no 17J-CROSSHAIRS. That’s got 
stubbornness written all over it. Have you 
noticed that?” 

“No. I mean, he didn’t want to see the 
movie I did. But I didn't care” 

“Well, you better get used to that. He'll be 
picking the movies.” 

Allison couldn't imagine a thing like that 
bothering her. Still, it was disconcerting to 
find a stain, however small, on her previ- 
ously spotless conception of Kevin. 

“And, look, Allie: NICKEL7. That’d 
be okay if he had JIB4, or the elongated 
DONALDSON-HARVEY, but, nope. That 
means highly confrontational. Highly.” 

“Huh,” Allison said, remembering the 
business of Kevin shushing the pair of obnox- 
ious teenagers whod been talking during the 
movie. But wasn't it admirable to stand up for 
ones self, when the situation dictated? Then 
again, the kids hadn't been so loud. 

“Oh, Allie: 76 UNION-Y-SAIL. Hostil- 
ity towards authority. Look, that’s linked 
to oppositional disorder ... all kinds of 
things... crime” 

Allison could come up with nothing to 
corroborate that, which somehow made 
it worse. She sounded unconvincing even 
to herself when she said, “But you cant tell 
how these things are going to express them- 
selves. Environment's just as important. 
Right? Free will? What's the saying? It’s a 
list of ingredients, not a recipe” 

“Yeah, but look at 
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When Allison came in, Jennifer set aside her 
magazine, and said, “So?” 

Allison’s stoical expression faltered before 
she reached the sofa. By the time she was 
seated beside Jennifer, her mouth was bent in 
a steep frown, and her eyes were welling up. 

“Tt’s over.” 

“Oh, Allie. You broke it off? It's probably 
for the best. Considering.” 

“He did” 

“What? Don’ tell me he didn’t like your 
profile. That’s some nerve, with —” 

“He never looked at my stupid profile. But 
he knew I looked at his. He said he could 
tell right away, I was treating him differ- 
ently. I could hear myself doing it, too, but I 
couldn't stop. It’s like, when you know those 
things about someone ... when you think 
you know...” 

“What was it, some kind of trap? Remem- 
ber, he has VIKING-F11? They list that as 
strategic thinking, but one of the corollaries 
is deceit?” 

“Tt’s a trap, but he didn’t set it. The whole 
thing’s a trap?” 

“Was he very... confrontational?” 

“Stop it, Jen. It wasn’t even him we were 
looking at.’ 

“We got the wrong profile?” 

“Tt was his profile, but not his genome. He 
posted a different one. One of those histori- 
cal reconstructions they do.” 

“What? Who?” 

“Gandhi” 

Jennifer stared. Finally she said, “Well, Id 
call that deceitful. I bet he has VIKING, at 
least.” 

Allison stared back for a moment, then 
stood. “I'm going to bed.” m 
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